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leaves, roots, fruits and seeds. Nucleic acid sequences and constructs encoding fatty 
acid desaturases, including A5-desaturases, A6-desaturases and A 12-desaturases, 
are used to generate transgenic plants, plant parts and cells which contain and 
express one or more transgenes encoding one or more desaturases. Expression of the 
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acid, eicosapentaenoic acid, a-linolenic acid, gamma-linolenic acid, arachidonic 
acid and the like for modification of the fatty acid profile of plants, plant parts 
and tissues. Manipulation of the fatty acid profiles allows for the production of 
commercial quantities of novel plant oils and products. 
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METHODS AND COMPOSITIONS FOR SYNTHESIS OF 
LONG CHAIN POLYUNSATURATED FATTY ACIDS IN PLANTS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of USSN 08/834,655, filed 
5 April 1 1, 1997, and a continuation in part of USSN 08/833,610, Filed April 1 1, 
1997, USSN 08/834,033 filed April 11, 1997 and USSN 08/956,985 filed 
October 24, 1997 which disclosures are incorporated herein by reference. 

INTRODUCTION 

Field of the Invention 

10 This invention relates to modulating levels of enzymes and/or enzyme 

components capable of altering the production of long chain polyunsaturated 
fatty acids (PUFAS) in a host plant. The invention is exemplified by the 
production of PUFAS in plants. 

Background 

15 Two main families of polyunsaturated fatty acids (PUFAs) are the co3 

fatty acids, exemplified by arachidonic acid, and the co6 fatty acids, exemplified 
by eicosapentaenoic acid. PUFAs are important components of the plasma 
membrane of the cell, where they may be found in such forms as phospholipids. 
PUFAs also serve as precursors to other molecules of importance in human 

20 beings and animals, including the prostacyclins, leukotrienes and 

prostaglandins. PUFAs are necessary for proper development, particularly in 
the developing infant brain, and for tissue formation and repair. 

Four major long chain PUFAs of importance include docosahexaenoic 

acid (DHA) and eicosapentaenoic acid (EPA), which are primarily found in 

25 different types of fish oil, gamma-linolenic acid (GLA), which is found in the 

seeds of a number of plants, including evening primrose (Oenothera biennis), 

borage (Borago officinalis) and black currants (Ribes nigrum), and stearidonic 

acid (SDA), which is found in marine oils and plant seeds. Both GLA and 

another important long chain PUFA, arachidonic acid (ARA), are found in 
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filamentous fungi. ARA can be purified from animal tissues including liver and 
adrenal gland. 

For DHA, a number of sources exist for commercial production 
including a variety of marine organisms, oils obtained from cold water marine 
5 fish, and egg yolk fractions. For ARA, microorganisms including the genera 
Mortierella, Entomophthora, Phytium and Porphyridium can be used for 
commercial production. Commercial sources of SDA include the genera 
Trichodesma and Echium. Commercial sources of GLA include evening 
primrose, black currants and borage. However, there are several disadvantages 

10 associated with commercial production of PUFAs from natural sources. Natural 
sources of PUFAs, such as animals and plants, tend to have highly 
heterogeneous oil compositions. The oils obtained from these sources therefore 
can require extensive purification to separate out one or more desired PUFAs or 
to produce an oil which is enriched in one or more PUFA. Natural sources also 

1 5 are subject to uncontrollable fluctuations in availability. Fish stocks may 
undergo natural variation or may be depleted by overfishing. Fish oils have 
unpleasant tastes and odors, which may be impossible to economically separate 
from the desired product, and can render such products unacceptable as food 
supplements. Animal oils, and particularly fish oils, can accumulate 

20 environmental pollutants. Weather and disease can cause fluctuation in yields 
from both fish and plant sources. Cropland available for production of alternate 
oil-producing crops is subject to competition from the steady expansion of 
human populations and the associated increased need for food production on the 
remaining arable land. Crops which do produce PUFAs, such as borage, have 

25 not been adapted to commercial growth and may not perform well in 

monoculture. Growth of such crops is thus not economically competitive where 
more profitable and better established crops can be grown. Large scale 
fermentation of organisms such as Mortierella is also expensive. Natural 
animal tissues contain low amounts of ARA and are difficult to process. 

30 Microorganisms such as Porphyridium and Mortierella are difficult to cultivate 
on a commercial scale. 

-2- 
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Dietary supplements and pharmaceutical formulations containing 
PUFAs can retain the disadvantages of the PUFA source. Supplements such as 
fish oil capsules can contain low levels of the particular desired component and 
thus require large dosages. High dosages result in ingestion of high levels of 
5 undesired components, including contaminants. Care must be taken in 

providing fatty acid supplements, as overaddition may result in suppression of 
endogenous biosynthetic pathways and lead to competition with other necessary 
fatty acids in various lipid fractions in vivo, leading to undesirable results. For 
example, Eskimos having a diet high in co3 fatty acids have an increased 
10 tendency to bleed (U.S. Pat. No. 4,874,603). Unpleasant tastes and odors of the 
supplements can make such regimens undesirable, and may inhibit compliance 
by the patient. 

A number of enzymes are involved in PUFA biosynthesis. Linoleic acid 
(LA, 1 8:2 A9, 12) is produced from oleic acid (1 8: 1 A9) by a A12-desaturase. 

15 GLA (18:3 A6, 9, 12) is produced from linoleic acid (LA, 18:2 A9, 12) by a A6- 

desaturase. ARA (20:4 A5, 8, 1 1, 14) production from DGLA (20:3 A8, 1 1, 14) ; 
is catalyzed by a A5-desaturase. However, animals cannot desaturate beyond 
the A9 position and therefore cannot convert oleic acid (18:1 A9) into linoleic 
acid (18:2 A9, 12). Likewise, ct-linolenic acid (ALA, 18:3 A9, 12, 15) cannot 

20 be synthesized by mammals. Other eukaryotes, including fungi and plants, have 
enzymes which desaturate at positions A21 and A15. The major poly- 
unsaturated fatty acids of animals therefore are either derived from diet and/or 
from desaturation and elongation of linoleic acid (18:2 A9, 12)or oc-linolenic 
acid (18:3 A9, 12, 15). 

25 Poly-unsaturated fatty acids are considered to be useful for nutritional, 

pharmaceutical, industrial, and other purposes. An expansive supply of poly- 
unsaturated fatty acids from natural sources and from chemical synthesis are not 
sufficient for commercial needs. Therefore it is of interest to obtain genetic 
material involved in PUFA biosynthesis from species that naturally produce 

30 these fatty acids and to express the isolated material alone or in combination in 
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a heterologous system which can be manipulated to allow production of 
commercial quantities of PUFAS. 

The present invention is further directed to formulas, dietary 
supplements or dietary supplements in the form of a liquid or a solid containing 
5 the long chain fatty acids of the invention. These formulas and supplements 
may be administered to a human or an animal. 

The formulas and supplements of the invention may further comprise at 
least one macronutrient selected from the group consisting of coconut oil, soy 
oil, canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
1 0 whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

The formulas of the present invention may further include at least one 
vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
1 5 calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

The present invention is further directed to a method of treating a patient 
having a condition caused by insuffient intake or production of polyunsaturated 
fatty acids comprising administering to the patient a dietary substitute of the 
20 invention in an amount sufficient to effect treatment of the patient. 

The present invention is further directed to cosmetic and pharmaceutical 
compositions of the material of the invention. 

The present invention is further directed to transgenic oils in 
pharmaceutically acceptable carriers. The present invention is further directed 
to nutritional supplements, cosmetic agents and infant formulae containing 
transgenic oils. 

The present invention is further directed to a method for obtaining 
altered long chain polyunsaturated fatty acid biosynthesis comprising the steps 
of: growing a microbe having cells which contain a transgene which encodes a 
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transgene expression product which desaturates a fatty acid molecule at carbon 
5,5 or 12 from the carboxyl end of said fatty acid molecule, wherein the 
trangene is operably associated with an expression control sequence, under 
conditions whereby the transgene is expressed, whereby long chain 
5 polyunsaturated fatty acid biosynthesis in the cells is altered. 

The present invention is further directed toward pharmaceutical 
compositions comprising at least one nutrient selected from the group consisting 
of a vitamin, a mineral, a carbohydrate, a sugar, an amino acid, a free fatty acid, 
a phospholipid, an antioxidant, and a phenolic compound. 

10 Relevant Literature 

Production of gamma-linolenic acid by a A6-desaturase is described in 
USPN 5,552,306 and USPN 5,614,393. Production of 8, 1 1-eicosadienoic acid 
using Mortierella alpina is disclosed in USPN 5,376,541. Production of 
docosahexaenoic acid by dinoflagellates is described in USPN 5,407,957. 

1 5 Cloning of a A6-desaturase from borage is described in PCT publication WO 
96/21022. Cloning of A9-desaturases is described in the published patent 
applications PCT WO 91/13972, EP 0 550 162 Al, EP 0 561 569 A2, EP 0 644 
263 A2, and EP 0 736 598 Al, and in USPN 5,057,419. Cloning of A12- 
desaturases from various organisms is described in PCT publication WO 

20 94/1 1516 and USPN 5,443,974. Cloning of A15-desaturases from various 

organisms is described in PCT publication WO 93/1 1245. A A6 palmitoyl-acyl 
carrier protein desaturase from Thumbergia alata and its expression in E. coli is 
described in USPN 5,614,400. Expression of a soybean stearyl-ACP desaturase 
in transgenic soybean embryos using a 35S promoter is disclosed in USPN 

25 5,443,974. 

SUMMARY OF THE INVENTION 

Novel compositions and methods are provided for preparation of poly- 
unsaturated long chain fatty acids and desaturases in plants and plant cells. The 
methods involve growing a host plant cell of interest transformed with an 
30 expression cassette functional in a host plant cell, the expression cassette 
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comprising a transcriptional and translation^ initiation regulatory region, joined 
in reading frame 5' to a DNA sequence encoding a desaturase polypeptide 
capable of modulating the production of PUFAs. Expression of the desaturase 
polypeptide provides for an alteration in the PUFA profile of host plant cells as 
5 a result of altered concentrations of enzymes involved in PUFA biosynthesis. 
Of particular interest is the selective control of PUFA production in plant tissues 
and/or plant parts such as leaves, roots, fruits and seeds. The invention finds 
use for example in the large scale production of DHA, EPA, ARA, and GLA 
and for modification of the fatty acid profile of edible plant tissues and/or plant 
1 0 parts. 

The present invention further includes a purified nucleotide sequence or 
polypeptide sequence that is substantially related or homologous to the 
nucleotide and peptide sequences presented in SEQ ID NO:l - SEQ ID NO:52. 
The present invention is further directed to methods of using the sequences 
1 5 presented in SEQ ID NO: 1 to SEQ ID NO:40 as probes to identify related 

sequences, as components of expression systems and as components of systems 
useful for producing transgenic oil. 

BRIEF DESCRIP TION OF THE DRAWINGS 

Figure 1 shows possible pathways for the synthesis of arachidonic acid 
20 (20:4 A5, 8, 11, 14) and stearidonic acid (18:4 A6, 9, 12, 15) from palmitic acid 
(Cj 6 ) from a variety of organisms, including algae, Mortierella and humans. 
These PUFAs can serve as precursors to other molecules important for humans 
and other animals, including prostacyclins, leukotrienes, and prostaglandins, 
some of which are shown. 

25 Figure 2 shows possible pathways for production of PUFAs in addition 

to ARA, including EPA and DHA, again compiled from a variety of organisms. 

Figure 3A-E shows the DNA sequence (SEQ ID NO:l) of the 
Mortierella alpina A6 desaturase and the deduced amino acid sequence (SEQ 
ID NO:2). 
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Figure 4 shows an alignment of the Mortierella alpina A6 desaturase 
amino acid sequence with other A6 desaturases and related sequences (SEQ ID 
NOS:7, 8, 9, 10, 11, 12 and 13). 

Figure 5A-D shows the DNA sequence of the Mortierella alpina A12 
5 desaturase (SEQ ID NO:3) and the deduced amino acid sequence (SEQ ID 
NO:4) 

Figure 6 shows the deduced amino acid sequence (SEQ ID NO: 14) of 
the PCR fragment (see Example 1). 

Figure 7A-D shows the DNA sequence of the Mortierella alpina A5 
1 0 desaturase (SEQ ID NO:5). 

Figure 8 shows alignments of the protein sequence of the A5 desaturase 
(SEQ ID NO:6) with A6 desaturases and related sequences (SEQ ID NOS:15, 
16,17,18). 

Figure 9 shows alignments of the protein sequence of the Ma 29 and 
15 contig 253538a. 

Figure 10 shows alignments of the protein sequence of Ma 524 and 
contig 253538a. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTINGS 
SEQ ID NO:l shows the DNA sequence of the Mortierella alpina A6 
20 desaturase. 

SEQ ID NO:2 shows the amino acid sequence of the Mortierella alpina 
A6 desaturase. 

SEQ ID NO:3 shows the DNA sequence of the Mortierella alpina A12 
desaturase. 

25 SE Q ID NO:4 shows the amino acid sequence of the Mortierella alpina 

A 12 desaturase. 
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SEQ ID NO:5 shows the DNA sequence of the Mortierella alpina A5 
desaturase. 

SEQ ID NO: 6 shows the amino acid sequence Mortierella alpina A5 
desaturase. 

5 SEQ ID NO:7 - SEQ ID NO: 13 show amino acid sequences that relate 

to Mortierella alpina A6 desaturase. 

SEQ ID NO: 14 shows an amino acid sequence of a PCR fragment of 
Example 1. 

SEQ ID NO: 15 - SEQ ID NO: 18 show amino acid sequences that relate 
1 0 to Mortierella alpina A5 and A6 desaturases. 

SEQ ID NO: 19 - SEQ ID NO:30 show PCR primer sequences. 

SEQ ID NO:31 - SEQ ID NO:37 show human nucleotide sequences. 

SEQ ID NO:38 - SEQ ID NO:44 show human peptide sequences. 

SEQ ID NO:45 - SEQ ID NO:46 show the nucleotide and amino acid 
1 5 sequence of a Dictyostelium discoideium desaturase. 

SEQ ID NO:47 - SEQ ID NO:50 show the nucleotide and deduced 
amino acid sequence of a Schizochytrium cDNA clone. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In order to ensure a complete understanding of the invention, the 
20 following definitions are provided: 

A5-Desaturase: A5 desaturase is an enzyme which introduces a double 
bond between carbons 5 and 6 from the carboxyl end of a fatty acid molecule. 

A6-Desaturasc: A6-desaturase is an enzyme which introduces a double 
bond between carbons 6 and 7 from the carboxyl end of a fatty acid molecule. 

25 A9-Desaturase: A9-desaturase is an enzyme which introduces a double 

bond between carbons 9 and 10 from the carboxyl end of a fatty acid molecule. 

-8- 
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A12-Desaturase: A12-desaturase is an enzyme which introduces a 
double bond between carbons 12 and 13 from the carboxyl end of a fatty acid 
molecule. 

Fatty Acids: Fatty acids are a class of compounds containing a long 
5 hydrocarbon chain and a terminal carboxylate group. Fatty acids include the 
following: 



Fatty Acid 


12:0 


lauric acid 




16:0 


palmitic acid 




16:1 


palmitoleic acid 




18:0 


stearic acid 




18:1 


oleic acid 


A9-18:l 


18:2 A5,9 


taxoleic acid 


A5,9-18:2 


18:2 A6,9 


6,9-octadecadienoic acid 


A6,9-18:2 


I O.J. 


linoleic acid 


A9,12-18:2 (LA) 


18:3 A6,9,12 


gamma-linolenic acid 


A6,9,12-18:3 (GLA) 


i o.i Ac n i ^ 

18:3 A5,9,12 


pinolenic acid 


A5,9,12-18:3 


1 Q*l 
1 5.3 


alpha-linolenic acid 


A9, 12, 15- 18:3 (ALA) 


18:4 


stearidonic acid 


A6,9,12,15-18:4 (SDA) 


20:0 


Arachidic acid 




20:1 


Eicoscenic Acid 




22:0 


behehic acid 




22:1 


erucic acid 




22:2 


Docasadienoic acid 




20:4 0)6 


arachidonic acid 


A5,8, 11,14-20:4 (ARA) 


20:3 o>6 


o>6-eicosatrienoic 
dihomo-gamma linolenic 


A8,l 1,14-20:3 (DGLA) 


20:5 o>3 


Eicosapentanoic 
(Timnodonic acid) 


A5,8, 1 1 , 14, 1 7-20:5 (EPA) 


20:3 0)3 


o)3-eicosatrienoic 


Al 1,16,17-20:3 


20:4 0)3 


o)3-eicosatetraenoic 


A8, 11,14,17-20:4 


22:5 0)3 


Docosapentaenoic 


A7,10,13,16,19-22:5 (o>3DPA) 


22:6 o)3 


Docosahexaenoic 
(cervonic acid) 


A4,7,10,13,16,19-22:6 (DHA) 


24:0 


Lignoceric acid 
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Taking into account these definitions, the present invention is directed to novel 
DNA sequences, DNA constructs, methods and compositions are provided 
which permit modification of the poly-unsaturated long chain fatty acid content 
of plant cells. Plant cells are transformed with an expression cassette 
5 comprising a DNA encoding a polypeptide capable of increasing the amount of 
one or more PUFA in a plant cell. Desirably, integration constructs may be 
prepared which provide for integration of the expression cassette into the 
genome of a host cell. Host cells are manipulated to express a sense or 
antisense DNA encoding a polypeptide(s) that has desaturase activity. By 

1 0 "desaturase" is intended a polypeptide which can desaturate one or more fatty 
acids to produce a mono- or poly-unsaturated fatty acid or precursor thereof of 
interest. By "polypeptide" is meant any chain of amino acids, regardless of 
length or post-translational modification, for example, glycosylation or 
phosphorylation. The substrate(s) for the expressed enzyme may be produced 

15 by the host cell or may be exogenously supplied. 

To achieve expression in a host cell, the transformed DNA is operably 
associated with transcriptional and translational initiation and termination 
regulatory regions that are functional in the host cell. Constructs comprising the 
gene to be expressed can provide for integration into the genome of the host cell 

20 or can autonomously replicate in the host cell. For production of linoleic acid 
(LA), the expression cassettes generally used include a cassette which provides 
for A12 desaturase activity, particularly in a host cell which produces or can 
take up oleic acid. For production of ALA, the expression cassettes generally 
used include a cassette which provides for A15 or co3 desaturase activity, 

25 particularly in a host cell which produces or can take up LA. For production of 
GLA or SDA, the expression cassettes generally used include a cassette which 
provides for A6 desaturase activity, particularly in a host cell which produces or 
can take up LA or ALA, respectively. Production of co6-type unsaturated fatty 
acids, such as LA or GLA, is favored in a plant capable of producing ALA by 

30 inhibiting the activity of a A 1 5 or co3 type desaturase; this is accomplished by 
providing an expression cassette for an antisense A15 or a>3 transcript, or by 

-10- 
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disrupting a Al 5 or co3 desaturase gene. Similarly, production of LA or ALA is 
favored in a plant having A6 desaturase activity by providing an expression 
cassette for an antisense A6 transcript, or by disrupting a A6 desaturase gene. 
Production of oleic acid likewise is favored in a plant having A12 desaturase 
5 activity by providing an expression cassette for an antisense A12 transcript, or 
by disrupting a A12 desaturase gene. For production of ARA, the expression 
cassette generally used provides for A5 desaturase activity, particularly in a host 
cell which produces or can take up DGLA. Production of ©6-type unsaturated 
fatty acids, such as ARA, is favored in a plant capable of producing ALA by 
10 inhibiting the activity of a A15 or o>3 type desaturase; this is accomplished by 
providing an expression cassette for an antisense A15 or co3 transcript, or by 
disrupting a A15 or <o3 desaturase gene. 

TRANSGENIC PLANT PRODUCTION OF FATTY ACIDS 
Transgenic plant production of PUFAs offers several advantages over 

1 5 purification from natural sources such as fish or plants. Production of fatty 
acids from recombinant plants provides the ability to alter the naturally 
occurring plant fatty acid profile by providing new synthetic pathways in the 
host or by suppressing undesired pathways, thereby increasing levels of desired 
PUFAs, or conjugated forms thereof, and decreasing levels of undesired 

20 PUFAs. Production of fatty acids in transgenic plants also offers the advantage 
that expression of desaturase genes in particular tissues and/or plant parts means 
that greatly increased levels of desired PUFAs in those tissues and/or parts can 
be achieved, making recovery from those tissues more economical. For 
example, the desired PUFAs can be expressed in seed; methods of isolating 

25 seed oils are well established. In addition to providing a source for purification 
of desired PUFAs, seed oil components can be manipulated through expression 
of desaturase genes, either alone or in combination with other genes such as 
elongases, to provide seed oils having a particular PUFA profile in concentrated 
form. The concentrated seed oils then can be added to animal milks and/or 

30 synthetic or semi-synthetic milks to serve as infant formulas where human 
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nursing is impossible or undesired, or in cases of malnourishment or disease in 
both adults and infants. 

For production of PUFAs, depending upon the host cell, the availability 
of substrate, and the desired end product(s), several polypeptides, particularly 
5 desaturases, are of interest including those polypeptides which catalyze the 
conversion of stearic acid to oleic acid, LA to GLA, of ALA to SDA, of oleic 
acid to LA, or of LA to ALA, which includes enzymes which desaturate at the 
A6, A9, A12, A15 or co3 positions. Considerations for choosing a specific 
polypeptide having desaturase activity include the pH optimum of the 

1 0 polypeptide, whether the polypeptide is a rate limiting enzyme or a component 
thereof, whether the desaturase used is essential for synthesis of a desired poly- 
unsaturated fatty acid, and/or co-factors required by the polypeptide. The 
expressed polypeptide preferably has parameters compatible with the 
biochemical environment of its location in the host cell. For example, the 

1 5 polypeptide may have to compete for substrate with other enzymes in the host 
cell. Analyses of the K m and specific activity of the polypeptide in question 
therefore are considered in determining the suitability of a given polypeptide for 
- modifying PUFA production in a given host cell. The polypeptide used in a 
particular situation therefore is one which can function under the conditions 

20 present in the intended host cell but otherwise can be any polypeptide having 
desaturase activity which has the desired characteristic of being capable of 
modifying the relative production of a desired PUFA. A scheme for the 
synthesis of arachidonic acid (20:4 A5, 8, 11, 14) from palmitic acid (C, 6 ) is 
shown in Figure 1. A key enzyme in this pathway is a A5-desaturase which 

25 converts DH-y-linolenic acid (DGLA, eicosatrienoic acid) to ARA. Conversion 
of a-linolenic acid (ALA) to stearidonic acid by a A6-desaturase is also shown. 
Production of PUFAs in addition to ARA, including EPA and DHA is shown in 
Figure 2. A key enzyme in the synthesis of arachidonic acid (20:4 A5, 8, 1 1, 
14) from stearic acid (Cig) is a A6-desaturase which converts the linoleic acid 

30 into y-linolenic acid. Conversion of a-linolenic acid (ALA) to stearidonic acid 
by a A6-desaturase also is shown. For production of ARA, the DNA sequence 
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used encodes a polypeptide having A5 desaturase activity. In particular 
instances, this can be coupled with an expression cassette which provides for 
production of a polypeptide having A6 desaturase activity and, optionally, a 
transcription cassette providing for production of antisense sequences to a A 15 
5 transcription product. The choice of combination of cassettes used depends in 
part on the PUFA profile of the host cell. Where the host cell A5-desaturase 
activity is limiting, overexpression of A5 desaturase alone generally will be 
sufficient to provide for enhanced ARA production. 

SOURCES OF POLYPEPTIDES 
1 0 HAVING DESATURASE ACTIVITY 

As sources of polypeptides having desaturase activity and 

oligonucleotides encoding such polypeptides are organisms which produce a 

desired poly-unsaturated fatty acid. As an example, microorganisms having an 

ability to produce ARA can be used as a source of A5-desaturase genes; 

1 5 microorganisms which GLA or SDA can be used as a source of A6-desaturase 
and/or A12-desaturase genes. Such microorganisms include, for example, those 
belonging to the genera Mortierella, Conidiobolus, Pythium, Phytophathora, 
Penicillium, Porphyridium, Coidosporium, Mucor, Fusarium, Aspergillus, 
Rhodotorula, and Entomophthora. Within the genus Porphyridium, of 

20 particular interest is Porphyridium cruentum. Within the genus Mortierella, of 
particular interest are Mortierella elongata, Mortierella exigua, Mortierella 
hygrophila, Mortierella ramanniana, var. angulispora, and Mortierella alpina. 
Within the genus Mucor, of particular interest are Mucor circinelloides and 
Mucor javanicus. 

25 DNAs encoding desired desaturases can be identified in a variety of 

ways. As an example, a source of the desired desaturase, for example genomic 
or cDNA libraries from Mortierella, is screened with detectable enzymatically- 
or chemically-synthesized probes, which can be made from DNA, RNA, or non- 
naturally occurring nucleotides, or mixtures thereof. Probes may be 

30 enzymatically synthesized from DNAs of known desaturases for normal or 
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reduced-stringency hybridization methods. Oligonucleotide probes also can be 
used to screen sources and can be based on sequences of known desaturases, 
including sequences conserved among known desaturases, or on peptide 
sequences obtained from the desired purified protein. Oligonucleotide probes 
5 based on amino acid sequences can be degenerate to encompass the degeneracy 
of the genetic code, or can be biased in favor of the preferred codons of the 
source organism. Oligonucleotides also can be used as primers for PCR from 
reverse transcribed mRNA from a known or suspected source; the PCR product 
can be the full length cDNA or can be used to generate a probe to obtain the 
1 0 desired full length cDNA. Alternatively, a desired protein can be entirely 

sequenced and total synthesis of a DNA encoding that polypeptide performed. 

Once the desired genomic or cDNA has been isolated, it can be 
sequenced by known methods. It is recognized in the art that such methods are 
subject to errors, such that multiple sequencing of the same region is routine and 

1 5 is still expected to lead to measurable rates of mistakes in the resulting deduced 
sequence, particularly in regions having repeated domains, extensive secondary 
structure, or unusual base compositions, such as regions with high GC base 
content. When discrepancies arise, resequencing can be done and can employ 
special methods. Special methods can include altering sequencing conditions 

20 by using: different temperatures; different enzymes; proteins which alter the 
ability of oligonucleotides to form higher order structures; altered nucleotides 
such as ITP or methylated dGTP; different gel compositions, for example 
adding formamide; different primers or primers located at different distances 
from the problem region; or different templates such as single stranded DNAs. 

25 Sequencing of mRNA can also be employed. 

For the most part, some or all of the coding sequence for the polypeptide 
having desaturase activity is from a natural source. In some situations, 
however, it is desirable to modify all or a portion of the codons, for example, to 
enhance expression, by employing host preferred codons. Host preferred 
30 codons can be determined from the codons of highest frequency in the proteins 
expressed in the largest amount in a particular host species of interest. Thus, the 
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coding sequence for a polypeptide having desaturase activity can be 
synthesized in whole or in part. All or portions of the DNA also can be 
synthesized to remove any destabilizing sequences or regions of secondary 
structure which would be present in the transcribed mRNA. All or portions of 
5 the DNA also can be synthesized to alter the base composition to one more 
preferable in the desired host cell. Methods for synthesizing sequences and 
bringing sequences together are well established in the literature. In vitro 
mutagenesis and selection, site-directed mutagenesis, or other means can be 
employed to obtain mutations of naturally occurring desaturase genes to 
10 produce a polypeptide having desaturase activity in vivo with more desirable 
physical and kinetic parameters for function in the host cell, such as a longer 
half-life or a higher rate of production of a desired polyunsaturated fatty acid. 

Desirable cDNAs have less than 60% A+T composition, preferably less 
than 50% A+T composition. On a localized scale of a sliding window of 20 
15 base pairs, it is preferable that there are no localized regions of the cDNA with 
greater than 75% A+T composition; with a window of 60 base pairs, it is 
preferable that there are no localized regions of the cDNA with greater than 
60%, more preferably no localized regions with greater than 55% A+T 
composition. 

20 Mortierella alpina Desaturases 

Of particular interest are the Mortierella alpina A5 -desaturase, A6- 
desaturase and A12-desaturase. The A5-desaturase has 446 amino acids; the 
amino acid sequence is shown in Figure 7. The gene encoding the Mortierella 
alpina A5-desaturase can be expressed in transgenic microorganisms to effect 

25 greater synthesis of ARA from DGLA. Other DNAs which are substantially 
identical in sequence to the Mortierella alpina A5-desaturase DNA, or which 
encode polypeptides which are substantially identical in sequence to the 
Mortierella alpina A5-desaturase polypeptide, also can be used. The 
Mortierella alpina A6-desaturase, has 457 amino acids and a predicted 

30 molecular weight of 51 .8 kD; the amino acid sequence is shown in Figure 3. 
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The gene encoding the Mortierella alpina A6-desaturase can be expressed in 
transgenic plants or animals to effect greater synthesis of GLA from linoleic 
acid or of stearidonic acid (SDA) from ALA. Other DNAs which are 
substantially identical in sequence to the Mortierella alpina A6-desaturase 
5 DNA, or which encode polypeptides which are substantially identical in 

sequence to the Mortierella alpina A6-desaturase polypeptide, also can be used. 

The Mortierella alpina A12-desaturase has the amino acid sequence 
shown in Figure 5. The gene encoding the Mortierella alpina A12-desaturase 
can be expressed in transgenic plants to effect greater synthesis of LA from 
1 0 oleic acid. Other DNAs which are substantially identical to the Mortierella 
alpina A12-desaturase DNA, or which encode polypeptides which are 
substantially identical to the Mortierella alpina A12-desaturase polypeptide, 
also can be used. 

By substantially identical in sequence is intended an amino acid 

1 5 sequence or nucleic acid sequence exhibiting in order of increasing preference 
at least 60%, 80%, 90% or 95% homology to the Mortierella alpina A5- 
desaturase amino acid sequence or nucleic acid sequence encoding the amino 
acid sequence. For polypeptides, the length of comparison sequences generally 
is at least 16 amino acids, preferably at least 20 amino acids, or most preferably 

20 35 amino acids. For nucleic acids, the length of comparison sequences 

generally is at least 50 nucleotides, preferably at least 60 nucleotides, and more 
preferably at least 75 nucleotides, and most preferably, 1 10 nucleotides. 
Homology typically is measured using sequence analysis software, for example, 
the Sequence Analysis software package of the Genetics Computer Group, 

25 University of Wisconsin Biotechnology Center, 1710 University Avenue, 
Madison, Wisconsin 53705, MEGAlign (DNAStar, Inc., 1228 S. Park St., 
Madison, Wisconsin 53715), and MacVector (Oxford Molecular Group, 2105 S. 
Bascom Avenue, Suite 200, Campbell, California 95008). Such software 
matches similar sequences by assigning degrees of homology to various 

30 substitutions, deletions, and other modifications. Conservative substitutions 

typically include substitutions within the following groups: glycine and alanine; 
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valine, isoleucine and leucine; aspartic acid, glutamic acid, asparagine, and 
glutamine; serine and threonine; lysine and arginine; and phenylalanine and 
tyrosine. Substitutions may also be made on the basis of conserved 
hydrophobicity or hydrophilicity (Kyte and Doolittle, J. Mol Biol 157: 105- 
5 132, 1982), or on the basis of the ability to assume similar polypeptide 
secondary structure (Chou and Fasman, Adv. Enzymol 47: 45-148, 1978). 

Other Desaturases 
Encompassed by the present invention are related desaturases from the 
same-or other organisms. Such related desaturases include variants of the 

1 0 disclosed A5-, A6- and Al 2-desaturases that occur naturally within the same or 
different species of Mortierella, as well as homologues of the disclosed A5- 
desaturase from other species and evolutionarily related protein having 
desaturase activity. Also included are desaturases which, although not 
substantially identical to the Mortierella alpina A5 -desaturase, desaturate a fatty 

1 5 acid molecule at carbon 5, 6 or 1 2, respectively, from the carboxyl end of a fatty 
acid molecule. Related desaturases can be identified by their ability to function 
substantially the same as the disclosed desaturases; that is, are still able to 
effectively convert DGLA to ARA, LA to GLA, ALA to SDA or oleic acid to 
LA, Related desaturases also can be identified by screening sequence databases 

20 for sequences homologous to the disclosed desaturase, by hybridization of a 

probe based on the disclosed desaturase to a library constructed from the source 
organism, or by RT-PCR using mRNA from the source organism and primers 
based on the disclosed desaturase. Such desaturases includes those from 
humans, Dictyostelium discoideum and Phaeodactylum tricornum. 

25 The regions of a desaturase polypeptide important for desaturase activity 

can be determined through routine mutagenesis, expression of the resulting 
mutant polypeptides and determination of their activities. Mutants may include 
deletions, insertions and point mutations, or combinations thereof. A typical 
functional analysis begins with deletion mutagenesis to determine the N- and C- 

30 terminal limits of the protein necessary for function, and then internal deletions, 
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insertions or point mutants are made to further determine regions necessary for 
function. Other techniques such as cassette mutagenesis or total synthesis also 
can be used. Deletion mutagenesis is accomplished, for example, by using 
exonucleases to sequentially remove the 5' or 3' coding regions. Kits are 
5 available for such techniques. After deletion, the coding region is completed by 
ligating oligonucleotides containing start or stop codons to the deleted coding 
region after 5' or 3' deletion, respectively. Alternatively, oligonucleotides 
encoding start or stop codons are inserted into the coding region by a variety of 
methods including site-directed mutagenesis, mutagenic PCR or by ligation 

10 onto DNA digested at existing restriction sites. Internal deletions can similarly 
be made through a variety of methods including the use of existing restriction 
sites in the DNA, by use of mutagenic primers via site directed mutagenesis or 
mutagenic PCR. Insertions are made through methods such as linker-scanning 
mutagenesis, site-directed mutagenesis or mutagenic PCR. Point mutations are 

1 5 made through techniques such as site-directed mutagenesis or mutagenic PCR. 

Chemical mutagenesis can also be used for identifying regions of a 
desaturase polypeptide important for activity. A mutated construct is expressed, 
and the ability of the resulting altered protein to function as a desaturase is 
assayed. Such structure-function analysis can determine which regions may be 
20 deleted, which regions tolerate insertions, and which point mutations allow the 
mutant protein to function in substantially the same way as the native 
desaturase. All such mutant proteins and nucleotide sequences encoding them 
are within the scope of the present invention. 

EXPRESSION OF DESATURASE GENES 
25 Once the DNA encoding a desaturase polypeptide has been obtained, it 

is placed in a vector capable of replication in a host cell, or is propagated in 
vitro by means of techniques such as PCR or long PCR. Replicating vectors 
can include plasmids, phage, viruses, cosmids and the like. Desirable vectors 
include those useful for mutagenesis of the gene of interest or for expression of 
30 the gene of interest in host cells. The technique of long PCR has made in vitro 
propagation of large constructs possible, so that modifications to the gene of 
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interest, such as mutagenesis or addition of expression signals, and propagation 
of the resulting constructs can occur entirely in vitro without the use of a 
replicating vector or a host cell. 

For expression of a desaturase polypeptide, functional transcriptional 
5 and translational initiation and termination regions are operably linked to the 
DNA encoding the desaturase polypeptide. Transcriptional and translational 
initiation and termination regions are derived from a variety of nonexclusive 
sources, including the DNA to be expressed, genes known or suspected to be 
capable of expression in the desired system, expression vectors, chemical 

10 synthesis, or from an endogenous locus in a host cell. Expression in a plant 
tissue and/or plant part presents certain efficiencies, particularly where the 
tissue or part is one which is easily harvested, such as seed, leaves, fruits, 
flowers, roots, etc. Expression can be targeted to that location within the plant 
by using specific regulatory sequences, such as those of USPN 5,463,174, 

15 USPN 4,943,674, USPN 5,106,739, USPN 5,175,095, USPN 5,420,034, USPN 
5,188,958, and USPN 5,589,379. Alternatively, the expressed protein can be an 
enzyme which produces a product which may be incorporated, either directly or 
upon further modifications, into a fluid fraction from the host plant. In the 
present case, expression of desaturase genes, or antisense desaturase transcripts, 

20 can alter the levels of specific PUFAs, or derivatives thereof, found in plant 
parts and/or plant tissues. The A5-desaturase polypeptide coding region is 
expressed either by itself or with other genes, in order to produce tissues and/or 
plant parts containing higher proportions of desired PUFAs or in which the 
PUFA composition more closely resembles that of human breast milk (Prieto et 

25 a/., PCT publication WO 95/24494). The termination region can be derived 
from the 3' region of the gene from which the initiation region was obtained or 
from a different gene. A large number of termination regions are known to and 
have been found to be satisfactory in a variety of hosts from the same and 
different genera and species. The termination region usually is selected more as 

30 a matter of convenience rather than because of any particular property. 
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The choice of a host cell is influenced in part by the desired PUFA 
profile of the transgenic cell, and the native profile of the host cell. As an 
example, for production of linoleic acid from oleic acid, the DN A sequence 
used encodes a polypeptide having A12 desaturase activity, and for production 
5 of GLA from linoleic acid, the DNA sequence used encodes a polypeptide 
having A6 desaturase activity. Use of a host cell which expresses A12 
desaturase activity and lacks or is depleted in A15 desaturase activity, can be 
used with an expression cassette which provides for overexpression of A6 
desaturase alone generally is sufficient to provide for enhanced GLA production 

10 in the transgenic cell. Where the host cell expresses A9 desaturase activity, 
expression of both a Al 2- and a A6-desaturase can provide for enhanced GLA 
production. In particular instances where expression of A6 desaturase activity is 
coupled with expression of A12 desaturase activity, it is desirable that the host 
cell naturally have, or be mutated to have, low A15 desaturase activity. 

1 5 Alternatively, a host cell for A6 desaturase expression may have, or be mutated 
to have, high A12 desaturase activity. 

Expression in a host cell can be accomplished in a transient or stable 
fashion. Transient expression can occur from introduced constructs which 
contain expression signals functional in the host cell, but which constructs do 

20 not replicate and rarely integrate in the host cell, or where the host cell is not 
proliferating. Transient expression also can be accomplished by inducing the 
activity of a regulatable promoter operably linked to the gene of interest, 
although such inducible systems frequently exhibit a low basal level of 
expression. Stable expression can be achieved by introduction of a construct 

25 that can integrate into the host genome or that autonomously replicates in the 
host cell. Stable expression of the gene of interest can be selected for through 
the use of a selectable marker located on or transfected with the expression 
construct, followed by selection for cells expressing the marker. When stable 
expression results from integration, integration of constructs can occur 

30 randomly within the host genome or can be targeted through the use of 

constructs containing regions of homology with the host genome sufficient to 
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target recombination with the host locus. Where constructs are targeted to an 
endogenous locus, all or some of the transcriptional and translational regulatory 
regions can be provided by the endogenous locus. 

When increased expression of the desaturase polypeptide in the source 
5 plant is desired, several methods can be employed. Additional genes encoding 
the desaturase polypeptide can be introduced into the host organism. 
Expression from the native desaturase locus also can be increased through 
homologous recombination, for example by inserting a stronger promoter into 
the host genome to cause increased expression, by removing destabilizing 
1 0 sequences from either the mRNA or the encoded protein by deleting that 
information from the host genome, or by adding stabilizing sequences to the 
mRNA (see USPN 4,910,141 and USPN 5,500,365.) 

When it is desirable to express more than one different gene, appropriate 
regulatory regions and expression methods, introduced genes can be propagated 

15 in the host cell through use of replicating vectors or by integration into the host 
genome. Where two or more genes are expressed from separate replicating 
vectors, it is desirable that each vector has a different means of replication. 
Each introduced construct, whether integrated or not, should have a different 
means of selection and should lack homology to the other constructs to maintain 

20 stable expression and prevent reassortment of elements among constructs. 
Judicious choices of regulatory regions, selection means and method of 
propagation of the introduced construct can be experimentally determined so 
that all introduced genes are expressed at the necessary levels to provide for 
synthesis of the desired products. 

25 Constructs comprising the gene of interest may be introduced into a host 

cell by standard techniques. These techniques include transfection, infection, 
holistic impact, electroporation, microinjection, scraping, or any other method 
which introduces the gene of interest into the host cell (see USPN 4,743,548, 
USPN 4,795,855, USPN 5,068,193, USPN 5,188,958, USPN 5,463,174, USPN 

30 5,565,346 and USPN 5,565,347). For convenience, a host cell which has been 
manipulated by any method to take up a DNA sequence or construct will be 
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referred to as "transformed" or "recombinant" herein. The subject host will 
have at least have one copy of the expression construct and may have two or 
more, depending upon whether the gene is integrated into the genome 
amplified, or is present on an extrachromosomal element having multiple copy 
numbers. 

The transformed host cell can be identified by selection for a marker 
contamed on the introduced construct. Alternatively, a separate marker 
construct may be introduced with the desired construct, as many transformation 
techniques introduce many DNA molecules into host cells. Typically, 
transformed hosts are selected for their ability to grow on selective media 
Selective media may incorporate an antibiotic or lack a factor necessary for 
growth of the untransformed host, such as a nutrient or growth factor An 
introduced marker gene therefor may confer antibiotic resistance, or encode an 
essential growth factor or enzyme, and permit growth on selective media when 
expressed in the transformed host cell. Desirably, resistance to kanamycin and 
the ammo glycoside G418 are of interest (see USPN 5,034,322). Selection of a 
transformed host can also occur when the expressed marker protein can be 
detected, either directly or indirectly. The marker protein may be expressed 
alone or as a fusion to another protein. The marker protein can be detected by 
«s enzymatic activity; for example 0 galactosidase can convert the substrate X- 
gal to a colored product, and luciferase can convert luciferin to a light-emitting 
product. The marker protein can be detected by its light-producing or 
mod.fying characteristics; for example, the green fluorescent protein of 
Aeauorea victoria fluoresces when illuminated with blue light. Antibodies can 
be used to detect the marker protein or a molecular tag on, for example, a 
protein of interest. Cells expressing the marker protein or tag can be selected 
for example, visually, or by techniques such as FACS or panning using 
antibodies. 

The PUFAs produced using the subject methods and compositions may 
be found in the host plant tissue and/or plant part as free fatty acids or in 
conjugated forms such as acylglycerols, phospholipids, sulfolipids or 
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glycolipids, and may be extracted from the host cell through a variety of means 
well-known in the art. Such means may include extraction with organic 
solvents, sonication, supercritical fluid extraction using for example carbon 
dioxide, and physical means such as presses, or combinations thereof. Of 
5 particular interest is extraction with hexane or methanol and chloroform. Where 
desirable, the aqueous layer can be acidified to protonate negatively charged 
moieties and thereby increase partitioning of desired products into the organic 
layer. After extraction, the organic solvents can be removed by evaporation 
under a stream of nitrogen. When isolated in conjugated forms, the products are 
10 enzymatically or chemically cleaved to release the free fatty acid or a less 

complex conjugate of interest, and are then subjected to further manipulations to 
produce a desired end product. Desirably, conjugated forms of fatty acids are 
cleaved with potassium hydroxide. 

PURIFICATION OF FATTY ACIDS 

1 5 If further purification is necessary, standard methods can be employed. 

Such methods include extraction, treatment with urea, fractional crystallization, 
HPLC, fractional distillation, silica gel chromatography, high speed 
centrifugation or distillation, or combinations of these techniques. Protection of 
reactive groups, such as the acid or alkenyl groups, may be done at any step 

20 through known techniques, for example alkylation or iodination. Methods used 
include methylation of the fatty acids to produce methyl esters. Similarly, 
protecting groups may be removed at any step. Desirably, purification of 
fractions containing ARA, DHA and EPA is accomplished by treatment with 
urea and/or fractional distillation. 

25 USES OF FATTY ACIDS 

The uses of the fatty acids of subject invention are several. Probes based 
on the DNAs of the present invention may find use in methods for isolating 
related molecules or in methods to detect organisms expressing desaturases. 
When used as probes, the DNAs or oligonucleotides need to be detectable. This 

30 is usually accomplished by attaching a label either at an internal site, for 
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example via incorporation of a modified residue, or at the 5' or 3' terminus. 
Such labels can be directly detectable, can bind to a secondary molecule that is 
detectably labeled, or can bind to an unlabelled secondary molecule and a 
detectably labeled tertiary molecule; this process can be extended as long as is 
5 practical to achieve a satisfactorily detectable signal without unacceptable levels 
of background signal. Secondary, tertiary, or bridging systems can include use 
of antibodies directed against any other molecule, including labels or other 
antibodies, or can involve any molecules which bind to each other, for example 
a biotin-streptavidin/avidin system. Detectable labels typically include 

1 0 radioactive isotopes, molecules which chemically or enzymatically produce or 
alter light, enzymes which produce detectable reaction products, magnetic 
molecules, fluorescent molecules or molecules whose fluorescence or light- 
emitting characteristics change upon binding. Examples of labelling methods 
can be found in USPN 5,01 1,770. Alternatively, the binding of target molecules 

15 can be directly detected by measuring the change in heat of solution on binding 
of probe to target via isothermal titration calorimetry, or by coating the probe or 
target on a surface and detecting the change in scattering of light from the 
surface produced by binding of target or probe, respectively, as may be done 
with the BIAcore system. 

20 PUFAs of the subject invention produced by recombinant means find 

applications in a wide variety of areas. Supplementation of humans or animals 
with PUFAs in various forms can result in increased levels not only of the 
added PUFAs, but of their metabolic progeny as well. For example, where the 
inherent A6-desaturase pathway is dysfunctional in an individual, treatment with 

25 GLA can result not only in increased levels of GLA, but also of downstream 
products such as ARA and prostaglandins (see Figure 1). Complex regulatory 
mechanisms can make it desirable to combine various PUFAs, or to add 
different conjugates of PUFAs, in order to prevent, control or overcome such 
mechanisms to achieve the desired levels of specific PUFAs in an individual. 

30 PUFAs, or derivatives thereof, made by the disclosed method can be 

used as dietary supplements, particularly in infant formulas, for patients 

-24- 



BNSOOCID: <WO 9846764A1> 



WO 98/46764 PCIYUS98/07421 



15 



undergoing intravenous feeding or for preventing or treating malnutrition. 
Particular fatty acids such as EPA are used to alter the composition of infant 
formulas to better replicate the PUFA composition of human breast milk. The 
predominant triglyceride in human milk has been reported to be l,3-di-oleoyl-2- 
5 palmitoyl, with 2-palmitoyl glycerides reported as better absorbed than 2-oleoyl 
or 2-lineoyl glycerides (USPN 4,876,107). Typically, human breast milk has a 
fatty acid profile comprising from about 0. 15 % to about 0.36 % as DHA, from 
about 0.03 % to about 0.13 % as EPA, from about 0.30 % to about 0.88 % as 
ARA, from about 0.22 % to about 0.67 % as DGLA, and from about 0.27 % to 
10 about 1.04 % as GLA. A preferred ratio of GLArDGLA: ARA in infant 
formulas is from about 1 : 1 :4 to about 1:1:1, respectively. Amounts of oils 
providing these ratios of PUFA can be determined without undue 
experimentation by one of skill in the art. PUFAs, or host cells containing 
them, also can be used as animal food supplements to alter an animal's tissue or 
milk fatty acid composition to one more desirable for human or animal 
consumption. 

NUTRITIONAL COMPOSITIONS 

The present invention also includes nutritional compositions. Such 
compositions, for purposes of the present invention, include any food or 
preparation for human consumption including for enteral or parenteral 
consumption, which when taken into the body (a) serve to nourish or build up 
tissues or supply energy and/or (b) maintain, restore or support adequate 
nutritional status or metabolic function. 

The nutritional composition of the present invention comprises at least 
one oil or acid produced in accordance with the present invention and may 
either be in a solid or liquid form. Additionally, the composition may include 
edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amount of such ingredients will vary depending on whether the 
composition is intended for use with normal, healthy infants, children or adults 
having specialized needs such as those which accompany certain metabolic 
conditions (e.g., metabolic disorders). 
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Examples of macronutrients which may be added to the composition 
include but are not limited to edible fats, carbohydrates and proteins. Examples 
of such edible fats include but are not limited to coconut oil, soy oil, and mono- 
and diglycerides. Examples of such carbohydrates include but are not limited to 
5 glucose, edible lactose and hydrolyzed search. Additionally, examples of 
proteins which may be utilized in the nutritional composition of the invention 
include but are not limited to soy proteins, electrodialysed whey , 
electrodialysed skim milk, milk whey, or the hydrolysates of these proteins. 

With respect to vitamins and minerals, the following may be added to 
10 the nutritional compositions of the present invention: calcium, phosphorus, 
potassium, sodium, chloride, magnesium, manganese, iron, copper, zinc, 
selenium, iodine, and Vitamins A, E, D, C, and the B complex. Other such 
vitamins and minerals may also be added. 

The components utilized in the nutritional compositions of the present 
15 invention will of semi-purified or purified origin. By semi-purified or purified 
is meant a material which has been prepared by purification of a natural 
material or by synthesis. 

Examples of nutritional compositions of the present invention include 
but are not limited to infant formulas, dietary supplements, and rehydration 
20 compositions. Nutritional compositions of particular interest include but are not 
limited to those utilized for enteral and parenteral supplementation for infants, 
specialist infant formulae, supplements for the elderly, and supplements for 
those with gastrointestinal difficulties and/or malabsorption. 

Nutritional Compositions 

25 A typical nutritional composition of the present invention will contain 

edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amounts of such ingredients will vary depending on whether the 
formulation is intended for use with normal, healthy individuals temporarily 
exposed to stress, or to subjects having specialized needs due to certain chronic 

30 or acute disease states (e.g., metabolic disorders). It will be understood by 
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persons skilled in the art that the components utilized in a nutritional 
formulation of the present invention are of semi-purified or purified origin. By 
semi-purified or purified is meant a material that has been prepared by 
purification of a natural material or by synthesis. These techniques are well 
5 known in the art (See, e.g., Code of Federal Regulations for Food Ingredients 
and Food Processing; Recommended Dietary Allowances, 10 th Ed., National 
Academy Press, Washington, D.C., 1989). 

In a preferred embodiment, a nutritional formulation of the present 
invention is an enteral nutritional product, more preferably an adult or child 
10 enteral nutritional product. Accordingly in a further aspect of the invention, a 
nutritional formulation is provided that is suitable for feeding adults or children 
who are experiencing stress. The formula comprises, in addition to the PUFAs 
of the invention; macronutrients, vitamins and minerals in amounts designed to 
provide the daily nutritional requirements of adults. 

15 The macronutritional components include edible fats, carbohydrates and 

proteins. Exemplary edible fats are coconut oil, soy oil, and mono- and 
diglycerides and the PUFA oils of this invention. Exemplary carbohydrates are 
glucose, edible lactose and hydrolyzed cornstarch. A typical protein source 
would be soy protein, electrodialysed whey or electrodialysed skim milk or milk 

20 whey, or the hydrolysates of these proteins, although other protein sources are 
also available and may be used. These macronutrients would be added in the 
form of commonly accepted nutritional compounds in amount equivalent to 
those present in human milk or an energy basis, i.e., on a per calorie basis. 

Methods for formulating liquid and enteral nutritional formulas are well 
25 known in the art and are described in detail in the examples. 

The enteral formula can be sterilized and subsequently utilized on a 

ready-to-feed (RTF) basis or stored in a concentrated liquid or a powder. The 

powder can be prepared by spray drying the enteral formula prepared as 

indicated above, and the formula can be reconstituted by rehydrating the 

30 concentrate. Adult and infant nutritional formulas are well known in the art and 

commercially available (e.g., Similac®, Ensure®, Jevity® and Alimentum® 
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from Ross Products Division, Abbott Laboratories). An oil or acid of the 
present invention can be added to any of these formulas in the amounts 
described below. 

The energy density of the nutritional composition when in liquid form, 
can typically range from about 0.6 Kcal to 3 Kcal per ml. When in solid or 
powdered form, the nutritional supplement can contain from about 1.2 to more 
than 9 Kcals per gm, preferably 3 to 7 Kcals per gm. In general, the osmolality 
of a liquid product should be less than 700 mOsm and more preferably less than 
660 mOsm. 

The nutritional formula would typically include vitamins and minerals 
in addition to the PUFAs of the invention, in order to help the individual inges't 
the minimum daily requirements for these substances. In addition to the PUFAs 
listed above, it may also be desirable to supplement the nutritional composition 
with zinc, copper, and folic acid in addition to antioxidants. It is believed that 
these substances will also provide a boost to the stressed immune system and 
thus will provide further benefits to the individual. The presence of zinc, 
copper or folic acid is optional and is not required in order to gain the beneficial 
effects on immune suppression. Likewise a pharmaceutical composition can be 
supplemented with these same substances as well. 

In a more preferred embodiment, the nutritional contains, in addition to 
the antioxidant system and the PUFA component, a source of carbohydrate 
wherein at least 5 weight % of said carbohydrate is an indigestible 
oligosaccharide. In yet a more preferred embodiment, the nutritional 
composition additionally contains protein, taurine and carnitine. 

The PUFAs, or derivatives thereof, made by the disclosed method can 
be used as dietary substitutes, or supplements, particularly infant formulas, for 
patients undergoing intravenous feeding or for preventing or treating 
malnutrition. Typically, human breast milk has a fatty acid profile comprising 
from about 0.15 »/„ to about 0.36 % as DHA, from about 0.03 % to about 0 . 13 % 
as EPA, from about 0.30 % to about 0.88 % as ARA, from about 0.22 % to 
about 0.67 % as DGLA, and from about 0.27 % to about 1 .04 % as GLA 
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Additionally, the predominant triglyceride in human milk has been reported to 
be l,3-di-oleoyl-2-palmitoyl, with 2-palmitoyl glycerides reported as better 
absorbed than 2-oleoyl or 2-lineoyl glycerides (USPN 4,876,107). Thus, fatty 
acids such as ARA, DGLA, GLA and/or EPA produced by the invention can be 
5 used to alter the composition of infant formulas to better replicate the PUFA 
composition of human breast milk. In particular, an oil composition for use in a 
pharmacologic or food supplement, particularly a breast milk substitute or 
supplement, will preferably comprise one or more of ARA, DGLA and GLA. 
More preferably the oil will comprise from about 0.3 to 30% ARA, from about 
10 0.2 to 30% DGLA, and from about 0.2 to about 30% GLA. 

In addition to the concentration, the ratios of ARA, DGLA and GLA can 
be adapted for a particular given end use. When formulated as a breast milk 
supplement or substitute, an oil composition which contains two or more of 
ARA, DGLA and GLA will be provided in a ratio of about 1 : 1 9:30 to about 
6:1 :0.2, respectively. For example, the breast milk of animals can vary in ratios 
of ARA:DGLA:DGL ranging from 1 : 19:30 to 6:1 :0.2, which includes 
intermediate ratios which are preferably about 1:1:1, 1:2:1, 1:1:4. When 
produced together in a host cell, adjusting the rate and percent of conversion of 
a precursor substrate such as GLA and DGLA to ARA can be used to precisely 
control the PUFA ratios. For example, a 5% to 10% conversion rate of DGLA 
to ARA can be used to produce an ARA to DGLA ratio of about 1:19, whereas 
a conversion rate of about 75% to 80% can be used to produce an ARA to 
DGLA ratio of about 6: 1. Therefore, whether in a cell culture system or in a 
host animal, regulating the timing, extent and specificity of desaturase 
25 expression as described can be used to modulate the PUFA levels and ratios. 
Depending on the expression system used, e.g., cell culture or an animal 
expressing oil(s) in its milk, the oils also can be isolated and recombined in the 
desired concentrations and ratios. Amounts of oils providing these ratios of 
PUFA can be determined following standard protocols. PUFAs, or host cells 
30 containing them, also can be used as animal food supplements to alter an 

animal's tissue or milk fatty acid composition to one more desirable for human 
or animal consumption. 
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For dietary supplementation, the purified PUFAs, or derivatives thereof, 
may be incorporated into cooking oils, fats or margarines formulated so that in 
normal use the recipient would receive the desired amount. The PUFAs may 
also be incorporated into infant formulas, nutritional supplements or other food 
5 products, and may find use as anti-inflammatory or cholesterol lowering agents. 

Pharmaceutical Compositions 
The present invention also encompasses a pharmaceutical composition 
comprising one or more of the acids and/or resulting oils produced in 
accordance with the methods described herein. More specifically, such a 

1 0 pharmaceutical composition may comprise one or more of the acids and/or oils 
as well as a standard, well-known, non-toxic pharmaceutically acceptable 
carrier, adjuvant or vehicle such as, for example, phosphate buffered saline, 
water, ethanol, polyols, vegetable oils, a wetting agent or an emulsion such as a 
water/oil emulsion. The composition may be in either a liquid or solid form. 

1 5 For example, the composition may be in the form of a tablet, capsule, ingestible 
liquid or powder, injectible, or topical ointment or cream. 

Possible routes of administration include, for example, oral, rectal and 
parenteral. The route of administration will, of course, depend upon the desired 
effect. For example, if the composition is being utilized to treat rough, dry, or 
20 aging skin, to treat injured or burned skin, or to treat skin or hair affected by a 
disease or condition, it may perhaps be applied topically. 

The dosage of the composition to be administered to the patient may be 
determined by one of ordinary skill in the art and depends upon various factors 
such as weight of the patient, age of the patient, immune status of the patient, 
25 etc. 

With respect to form, the composition may be, for example, a solution, a 
dispersion, a suspension, an emulsion or a sterile powder which is then 
reconstituted. 
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Additionally, the composition of the present invention may be utilized 
for cosmetic purposes. It may be added to pre-existing cosmetic compositions 
such that a mixture is formed or may be used as a sole composition. 

Pharmaceutical compositions may be utilized to administer the PUFA 
5 component to an individual. Suitable pharmaceutical compositions may 

comprise physiologically acceptable sterile aqueous or non-aqueous solutions, 
dispersions, suspensions or emulsions and sterile powders for reconstitution into 
sterile solutions or dispersions for ingestion. Examples of suitable aqueous and 
non-aqueous carriers, diluents, solvents or vehicles include water, ethanol, 

1 0 polyols (propy leneglycol, polyethyleneglycol, glycerol, and the like), suitable 
mixtures thereof, vegetable oils (such as olive oil) and injectable organic esters 
such as ethyl oleate. Proper fluidity can be maintained, for example, by the 
maintenance of the required particle size in the case of dispersions and by the 
use of surfactants. It may also be desirable to include isotonic agents, for 

1 5 example sugars, sodium chloride and the like. Besides such inert diluents, the 
composition can also include adjuvants, such as wetting agents, emulsifying and 
suspending agents, sweetening, flavoring and perfuming agents. 

Suspensions, in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
20 polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 

aluminum metahydroxide, bentonite, agar-agar and tragacanth or mixtures of 
these substances, and the like. 

Solid dosage forms such as tablets and capsules can be prepared using 
techniques well known in the art. For example, PUFAs of the invention can be 

25 tableted with conventional tablet bases such as lactose, sucrose, and cornstarch 
in combination with binders such as acacia, cornstarch or gelatin, disintegrating 
agents such as potato starch or alginic acid and a lubricant such as stearic acid 
or magnesium stearate. Capsules can be prepared by incorporating these 
excipients into a gelatin capsule along with the antioxidants and the PUFA 

30 component. The amount of the antioxidants and PUFA component that should 
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be incorporated into the pharmaceutical formulation should fit within the 
guidelines discussed above. 

As used in this application, the term "treat" refers to either preventing, or 
reducing the incidence of, the undesired occurrence. For example, to treat 
5 immune suppression refers to either preventing the occurrence of this 

suppression or reducing the amount of such suppression. The terms "patient" 
and "individual" are being used interchangeably and both refer to an animal. 
The term "animal" as used in this application refers to any warm-blooded 
mammal including, but not limited to, dogs, humans, monkeys, and apes. As 
1 0 used in the application the term "about" refers to an amount varying from the 
stated range or number by a reasonable amount depending upon the context of 
use. Any numerical number or range specified in the specification should be 
considered to be modified by the term about. 

"Dose" and "serving" are used interchangeably and refer to the amount 
15 of the nutritional or pharmaceutical composition ingested by the patient in a 
single setting and designed to deliver effective amounts of the antioxidants and 
the structured triglyceride. As will be readily apparent to those skilled in the 
art, a single dose or serving of the liquid nutritional powder should supply the 
amount of antioxidants and PUFAs discussed above. The amount of the dose or 
20 serving should be a volume that a typical adult can consume in one sitting. This 
amount can vary widely depending upon the age, weight, sex or medical 
condition of the patient. However as a general guideline, a single serving or 
dose of a liquid nutritional produce should be considered as encompassing a 
volume from 100 to 600 ml, more preferably from 125 to 500 ml and most 
25 preferably from 125 to 300 ml. 

The PUFAs of the present invention may also be added to food even 
when supplementation of the diet is not required. For example, the composition 
may be added to food of any type including but not limited to margarines, 
modified butters, cheeses, milk, yogurt, chocolate, candy, snacks, salad oils, 
30 cooking oils, cooking fats, meats, fish and beverages. 
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Pharmaceutical Applications 

For pharmaceutical use (human or veterinary), the compositions are 
generally administered orally but can be administered by any route by which 
they may be successfully absorbed, e.g., parenterally (i.e. subcutaneously, 
5 intramuscularly or intravenously), rectally or vaginally or topically, for 

example, as a skin ointment or lotion. The PUFAs of the present invention may 
be administered alone or in combination with a pharmaceutically acceptable 
carrier or excipient. Where available, gelatin capsules are the preferred form of 
oral administration. Dietary supplementation as set forth above also can 

1 0 provide an oral route of administration. The unsaturated acids of the present 
invention may be administered in conjugated forms, or as salts, esters, amides 
or prodrugs of the fatty acids. Any pharmaceutically acceptable salt is 
encompassed by the present invention; especially preferred are the sodium, 
potassium or lithium salts. Also encompassed are the N-alkylpolyhydrdxamine 

1 5 salts, such as N-methyl glucamine, found in PCT publication WO 96/33 1 55. 
The preferred esters are the ethyl esters. As solid salts, the PUFAs also can be 
administered in tablet form. For intravenous administration, the PUFAs or 
derivatives thereof may be incorporated into commercial formulations such as 
Intralipids. The typical normal adult plasma fatty acid profile comprises 6.64 to 

20 9.46% of ARA, 1 .45 to 3. 1 1 % of DGLA, and 0.02 to 0.08% of GLA. These 
PUFAs or their metabolic precursors can be administered, either alone or in 
mixtures with other PUFAs, to achieve a normal fatty acid profile in a patient. 
Where desired, the individual components of formulations may be individually 
provided in kit form, for single or multiple use. A typical dosage of a particular 

25 fatty acid is from 0. 1 mg to 20 g, or even 1 00 g daily, and is preferably from 1 0 
mg to 1, 2, 5 or 10 g daily as required, or molar equivalent amounts of 
derivative forms thereof. Parenteral nutrition compositions comprising from 
about 2 to about 30 weight percent fatty acids calculated as triglycerides are 
encompassed by the present invention; preferred is a composition having from 

30 about 1 to about 25 weight percent of the total PUFA composition as GLA 

(USPN 5,196,198). Other vitamins, and particularly fat-soluble vitamins such 

as vitamin A, D, E and L-carnitine can optionally be included. Where desired a 
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preservative such as a tocopherol may be added, typically at about 0.1% by 
weight. 

Suitable pharmaceutical compositions may comprise physiologically 
acceptable sterile aqueous or non-aqueous solutions, dispersions, suspensions or 
5 emulsions and sterile powders for reconstitution into sterile injectible solutions 
or dispersions. Examples of suitable aqueous and non-aqeuous carriers, 
diluents, solvents or vehicles include water, ethanol, polyols (propylleneglyol, 
polyethylenegycol, glycerol, and the like), suitable mixtures thereof, vegetable 
oils (such as olive oil) and injectable organic esters such as ehyl oleate. Proper 

1 0 fluidity can be maintained, for example, by the maintenance of the required 

particle size in the case of dispersions and by the use of surfactants. It may also 
be desirable to include isotonic agents, for example sugars, sodium chloride and 
the like. Besides such inert diluents, the composition can also include 
adjuvants, such as wetting agents, emulsifying and suspending agents, 

1 5 sweetening, flavoring and perfuming agents. 

Suspensions in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 
aluminum metahydroxide, bentonite, agar-agar and tragacanth, or mixtures of 
20 these substances and the like. 

An especially preferred pharmaceutical composition contains 
diacetyltartaric acid esters of mono- and diglycerides dissolved in an aqueous 
medium or solvent. Diacetyltartaric acid esters of mono- and diglycerides have 
an HLB value of about 9-12 and are significantly more hydrophilic than existing 

25 antimicrobial lipids that have HLB values of 2-4. Those existing hydrophobic 
lipids cannot be formulated into aqueous compositions. As disclosed herein, 
those lipids can now be solubilized into aqueous media in combination with 
diacetyltartaric acid esters of mono-and diglycerides. In accordance with this 
embodiment, diacetyltartaric acid esters of mono- and diglycerides (e.g., 

30 DATEM-C12:0) is melted with other active antimicrobial lipids (e.g., 18:2 and 
12:0 monoglycerides) and mixed to obtain a homogeneous mixture. 
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Homogeneity allows for increased antimicrobial activity. The mixture can be 
completely dispersed in water. This is not possible without the addition of 
diacetyltartaric acid esters of mono- and diglycerides and premixing with other 
monoglycerides prior to introduction into water. The aqueous composition can 
5 then be admixed under sterile conditions with physiologically acceptable 

diluents, preservatives, buffers or propellants as may be required to form a spray 
or inhalant. 

The present invention also encompasses the treatment of numerous 
disorders with fatty acids. Supplementation with PUFAs of the present 

10 invention can be used to treat restenosis after angioplasty. Symptoms of 

inflammation, rheumatoid arthritis, and asthma and psoriasis can be treated with 
the PUFAs of the present invention. Evidence indicates that PUFAs may be 
involved in calcium metabolism, suggesting that PUFAs of the present 
invention may be used in the treatment or prevention of osteoporosis and of 

15 kidney or urinary tract stones. 

The PUFAs of the present invention can be used in the treatment of 
cancer. Malignant cells have been shown to have altered fatty acid 
compositions; addition of fatty acids has been shown to slow their growth and 
cause cell death, and to increase their susceptibility to chemotherapeutic agents. 

20 GLA has been shown to cause reexpression on cancer cells of the E-cadherin 
cellular adhesion molecules, loss of which is associated with aggressive 
metastasis. Clinical testing of intravenous administration of the water soluble 
lithium salt of GLA to pancreatic cancer patients produced statistically 
significant increases in their survival. PUFA supplementation may also be 

25 useful for treating cachexia associated with cancer. 

The PUFAs of the present invention can also be used to treat diabetes 
(USPN 4,826,877; Horrobin et aL, Am. J. Clin. Nutr. Vol. 57 (Suppl.), 732S- 
737S). Altered fatty acid metabolism and composition has been demonstrated 
in diabetic animals. These alterations have been suggested to be involved in 
30 some of the long-term complications resulting from diabetes, including 
retinopathy, neuropathy, nephropathy and reproductive system damage. 
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Primrose oil, which contains GLA, has been shown to prevent and reverse 
diabetic nerve damage. 

The PUFAs of the present invention can be used to treat eczema, reduce 
blood pressure and improve math scores. Essential fatty acid deficiency has 
5 been suggested as being involved in eczema, and studies have shown beneficial 
effects on eczema from treatment with GLA. GLA has also been shown to 
reduce increases in blood pressure associated with stress, and to improve 
performance on arithmetic tests. GLA and DGLA have been shown to inhibit 
platelet aggregation, cause vasodilation, lower cholesterol levels and inhibit 

1 0 proliferation of vessel wall smooth muscle and fibrous tissue (Brenner et al. , 
Adv. Exp. Med. Biol. Vol. 83, p. 85-101, 1976). Administration of GLA or 
DGLA, alone or in combination with EPA, has been shown to reduce or prevent 
gastro-intestinal bleeding and other side effects caused by non-steroidal anti- 
inflammatory drugs (USPN 4,666,701). GLA and DGLA have also been shown 

1 5 to prevent or treat endometriosis and premenstrual syndrome (USPN 4,758,592) 
and to treat myalgic encephalomyelitis and chronic fatigue after viral infections 
(USPN 5,1 16,871). 

Further uses of the PUFAs of this invention include use in treatment of 
AIDS, multiple schlerosis, acute respiratory syndrome, hypertension and 
20 inflammatory skin disorders. The PUFAs of the inventions also can be used for 
formulas for general health as well as for geriatric treatments. 

Veterinary Applications 

It should be noted that the above-described pharmaceutical and 
nutritional compositions may be utilized in connection with animals, as well as 
25 humans, as animals experience many of the same needs and conditions as 

human. For example, the oil or acids of the present invention may be utilized in 
animal feed supplements or as animal feed substitutes. 

The following examples are presented by way of illustration, not of 
limitation. 
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Examples 

Example 1 Isolation of A5 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 2 Isolation of A6 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 3 Identification of A6 Desaturases Homologues to the 
Mortierella alpina A Desaturase 

Example 4 Isolation of D- 1 2 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 5 Isolation of Cytochrome b5 Reductase Nucleotide 
Sequence from Mortierella alpina 

Example 6 Expression of M alpina Desaturase Clones in Baker's 
Yeast 

Example 7 Fatty Acid Analysis of Leaves from Ma29 Transgenic 
Brassica Plants 

Example 8 Expression of M. alpina A6 Desaturase in Brassica 
napus 

Example 9 Expression of M. alpina A 1 2 desaturase in Brassica 
napus 

Example 10 Simultaneous expression of M. alpina A6 and A 12 
desaturases in Brassica napus 

Example 1 1 Simultaneous expression of M. alpina A5 and A6 
desaturases in Brassica napus 

Example 12 Simultaneous expression of M alpina A5, A6 and A12 
desaturases in Brassica napus 

Example 1 3 Stereospecific Distribution of A6-Desaturated Oils 

Example 14 Fatty Acid Compositions of Transgenic Plants 
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Example 15 



Combined Expression of A6 and A 12 Desaturases in B. 
napus Achieved by Crossing 



Example 1 6 



Expression of M alpina desaturases in soybean 



Example 17 



Human Desaturase Gene Sequences 



5 



Example 1 



Isolation of a A5-dcsaturase Nucleotide Sequence from Mortierella alpina 
Motierella alpina produces arachidonic acid (ARA, 20:4) from the 
precursor 20:3 by a A5 -desaturase. A nucleotide sequence encoding the A5- 
desaturase from Mortierella alpina (see Figure 7) was obtained through PCR 
10 amplification using M alpina 1 st strand cDNA and degenerate oligonucleotide 
primers corresponding to amino acid sequences conserved between A6- 
desaturases from Synechocystis and Spirulina. The procedure used was as 
follows: 



1 5 Mortierella alpina using the protocol of Hoge et al (1982) Experimental 

Mycology 6:225-232. The RNA was used to prepare double-stranded cDNA 
using BRL's lambda-ZipLox system, following the manufacturer's instructions. 
Several size fractions of the M. alpina cDNA were packaged separately to yield 
libraries with different average-sized inserts. The "full-length" library contains 

20 approximately 3 x 10 6 clones with an average insert size of 1 .77 kb. The 
"sequencing-grade" library contains approximately 6 x 10 5 clones with an 
average insert size of 1 . 1 kb. 

5^g of total RNA was reverse transcribed using BRL Superscript RTase 
and the primer TSyn 5'-CAAGCTTCTGCAGGAGCTCTTTTTTTTTTTTTTT- 
25 3' (SEQ ID NO: 1 9.) Degenerate oligonucleotides were designed to regions 
conserved between the two cyanobacterial A6-desaturase sequences. The 
specific primers used were: 



Total RNA was isolated from a 3 day old PUFA-producing culture of 
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D6DESAT-F3 (SEQ ID NO:20) 

5'-CUACUACUACUACAYCAYACOTAYACOAAYAT-3' 
D6DESAT-R3 ( SEQ ID NO:21) 

5 ' -C AUC AUC AUCAUOGGRAAO ARRTGRTG-3 ' 

5 where Y=C+T, R=A+G, and OI+C. PCR amplification was carried out in a 
25jal volume containing: template derived from 40 ng total RNA, 2 pM each 
primer, 200 |iM each deoxyribonucleotide triphosphate, 60 mM Tris-Cl, pH 8.5, 
15 mM (NHLO2SO4, 2 mM MgCl 2 . Samples were subjected to an initial 
desaturation step of 95 degrees (all temperatures Celsius) for 5 minutes, then 

10 held at 72 degrees while 0.2 U of Taq polymerase were added. PCR 

thermocycling conditions were as follows: 94 degrees for 1 min., 45 degrees 
for 1.5 min., 72 degrees for 2 min. PCR was continued for 35 cycles. PCR 
using these primers on the M alpina first-strand cDNA produced a 550 bp 
reaction product. Comparison of the deduced amino acid sequence of the M 

1 5 alpina PCR fragment revealed regions of homology with A6-desaturases (see 
Figure 4). However, there was only about 28% identity over the region 
compared. The deduced amino acid sequence is presented in SEQ ID NO: 14. 

The PCR product was used as a probe to isolate corresponding cDNA 
clones from a M alpina library. The longest cDNA clone, Ma29, was 

20 designated pCGN5521 and has been completely sequenced on both strands. 
The cDNA is contained as a 1481 bp insert in the vector pZLl (Bethesda 
Research Laboratories) and, beginning with the first ATG, contains an open 
reading frame encoding 446 amino acids. The reading frame contains the 
sequence deduced from the PCR fragment. The sequence of the cDNA insert 

25 was found to contain regions of homology to A6-desaturases (see Figure 8). For 
example, three conserved "histidine boxes" (that have been observed in other 
membrane-bound desaturases (Okuley et al. 9 (1994) The Plant Cell 5:147-158)) 
were found to be present in the Mortierella sequence at amino acid positions 
171-175, 207-212, and 387-391 (see Figure 5A-5D). However, the typical 

30 "HXXHH" amino acid motif for the third histidine box for the Mortierella 
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desaturase was found to be QXXHH. The amino-terminus of the encoded 
protein, showed significant homology to cytochrome b5 proteins. Thus, the 
Mortierella cDNA clone appears to represent a fusion between a cytochrome b5 
and a fatty acid desaturase. Since cytochrome b5 is believed to function as the 
5 electron donor for membrane-bound desaturase enzymes, it is possible that the 
N-terminal cytochrome b5 domain of this desaturase protein is involved in its 
function. This may be advantageous when expressing the desaturase in 
heterologous systems for PUFA production. 

Example 2 

10 Isolation of A6 Desaturase Nucle otide Sequence from Mortierella alpina 
A nucleic acid sequence from a partial cDNA clone, Ma524, encoding a 
A6 fatty acid desaturase from Mortierella alpina was obtained by random 
sequencing of clones from the M. alpina cDNA library described in Example 1. 
cDNA-containing plasmids were excised as follows: 

1 5 Five u.1 of phage were combined with 1 00 ul of E. coli DH 1 OB(ZIP) 

grown in ECLB plus 10 ng/ml kanamycin, 0.2% maltose, and 10 mM MgS0 4 
and incubated at 37 degrees for 15 minutes. 0.9 ml SOC was added and 100 ul 
of the bacteria immediately plated on each of 10 ECLB + 50 u.g Pen plates. No 
45 minute recovery time was needed. The plates were incubated overnight at 37 

20 degrees. Colonies were picked into ECLB + 50 ng Pen media for overnight 
cultures to be used for making glycerol stocks and miniprep DNA. An aliquot 
of the culture used for the miniprep is stored as a glycerol stock. Plating on 
ECLB + 50 fig Pen/ml resulted in more colonies and a greater proportion of 
colonies containing inserts than plating on 100 jig/ml Pen. 

25 Random colonies were picked and plasmid DNA purified using Qiagen 

miniprep kits. DNA sequence was obtained from the 5' end of the cDNA insert 
and compared to the databases using the BLAST algorithm. Ma524 was 
identified as a putative A6 desaturase based on DNA sequence homology to 
previously identified A6 desaturases. A full-length cDNA clone was isolated 
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from the M. alpina library. The abundance of this clone appears to be slightly 
(2X) less than Ma29. Ma524 displays significant homology to a portion of a 
Caenorhabditis elegans cosmid, W06D2.4, a cytochrome b5/desaturase fusion 
protein from sunflower, and the two A6 desaturases in the public databanks 
5 those from Synechocystis and Spirulina. 

In addition, Ma524 shows significant homology to the borage A6- 
desaturase sequence (PCT publication WO 96/21022). Ma524 thus appears to 
encode a A6-desaturase that is related to the borage and algal A6-desaturases. It 
should be noted that, although the amino acid sequences of Ma524 and the 

10 borage A6 are similar, the base composition of the cDNAs is quite different: the 
borage cDNA has an overall base composition of 60 % A+T, with some regions 
exceeding 70 %, while Ma524 has an average of 44 % A+T base composition, 
with no regions exceeding 60 %. This may have implications for expressing the 
cDNAs in microorganisms or animals which favor different base compositions. 

15 It is known that poor expression of recombinant genes can occur when the host 
has a very different base composition from that of the introduced gene. 
Speculated mechanisms for such poor expression include decreased stability or 
translatability of the mRNA. 



Example 3 

20 Identification of A6-desaturases Homologous 

to the Mortierella alpina A6-desaturase 

Nucleic acid sequences that encode putative A6-desaturases were 

identified through a BLASTX search of the est databases through NCBI using 

the Ma524 amino acid sequence. Several sequences showed significant 

25 homology. In particular, the deduced amino acid sequence of two Arabidopsis 
thaliana sequences, (accession numbers F 13728 and T42806) showed 
homology to two different regions of the deduced amino acid sequence of 
Ma524. The following PCR primers were designed: ATTS4723-FOR 
(complementary to F 13728) S'-CUACUACUACUAGGAGTCCTCTA 

30 CGGTGTTTTG, SEQ ID NO:22, and T42806-REV (complementary to 
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T42806) 5' CAUCAUCAUCAUATGATGCTCAAGCTGAAACTG, SEQ ID 
NO:23. Five ug of total RNA isolated from developing siliques of Arabidopsis 
thaliana was reverse transcribed using BRL Superscript RTase and the primer 
TSyn S'-CCAAGCTTCTGCAGGAGCTC' 1 11111111 1 nTlT-3', (SEQ ID 
5 NO:24). PCR was carried out in a 50 ul volume containing: template derived 
from 25 ng total RNA, 2 pM each primer, 200 uM each deoxyribonucleotide 
triphosphate, 60 mM Tris-Cl, pH 8.5, 15 mM (NH^SC^, 2 mM MgCl 2 , 0.2 U 
Taq Polymerase. Cycle conditions were as follows: 94 degrees for 30 sec, 50 
degrees for 30 sec, 72 degrees for 30 sec. PCR was continued for 35 cycles 

1 0 followed by an additional extension at 72 degrees for 7 minutes. PCR resulted 
in a fragment of -750 base pairs which was subsequently subcloned, named 12- 
5, and sequenced. Each end of this fragment corresponds to the Arabidopsis 
est from which the PCR primers were derived. This is the sequence named 12-5. 
The deduced amino acid sequence of 12-5 is compared to that of Ma524 and 

1 5 ests from human (W28 1 40), mouse (W53753), and C. elegans (R052 1 9) in 
Figure 4. Based on homology, these sequences represent desaturase 
polypeptides. The full-length genes can be cloned using probes based on the est 
sequences. The genes can then be placed in expression vectors and expressed in 
host cells and their specific A6- or other desaturase activity can be determined 

20 as described below. 



Example 4 

Isolation of A-12 Desaturase Nucle o tide Sequence from Mortierella alnina 
Based on the fatty acids it accumulates, Mortierella alpina has an co6 
type desaturase. The to6 desaturase is responsible for the production of linoleic 
acid (18:2) from oleic acid (18:1). Linoleic acid (18:2) is a substrate for a A6 
desaturase. This experiment was designed to determine if Mortierella alpina 
has a A12-desaturase polypeptide, and if so, to identify the corresponding 
nucleotide sequence. A random colony from the M. alpina sequencing grade 
library, Ma648, was sequenced and identified as a putative desaturase based on 
DNA sequence homology to previously identified desaturases, as described for 
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Ma524 (see Example 2). The deduced amino acid sequence from the 5* end of 
the Ma648 cDNA displays significant homology to soybean microsomal a>6 
(A 12) desaturase (accession #L43921) as well as castor bean oleate 12- 
hydroxylase (accession #U22378). In addition, homology is observed to a 
5 variety of other co6 (A 12) and co3 (Al 5) fatty acid desaturase sequences. 

Example 5 

Isolation of Cyt ochrome b5 Reductase Nucleotide Sequence 
from Mortierella alnitta 

A nucleic acid sequence encoding a cytochrome b5 reductase from 

1 0 Mortierella alpina was obtained as follows. A cDNA library was constructed 

based on total RNA isolated from Mortierella alpina as described in Example 1 . 

DNA sequence was obtained from the 5* and 3' ends of one of the clones, Ml 2- 

27. A search of public databanks with the deduced amino acid sequence of the 

3* end of Ml 2-27 (see Figure 5) revealed significant homology to known 

1 5 cytochrome b5 reductase sequences. Specifically, over a 49 amino acid region, 

the Mortierella clone shares 55% identity (73% homology) with a cytochrome 

b5 reductase from pig (see Figure 4). 

Example 6 

Expression of M alpina Desaturase Clones in Baker's Yeast 
20 Yeast Transformation 

Lithium acetate transformation of yeast was performed according to 
standard protocols (Methods in Enzymology, Vol. 194, p. 186-187, 1991). 
Briefly, yeast were grown in YPD at 30°C. Cells were spun down, resuspended 
in TE, spun down again, resuspended in TE containing 100 mM lithium acetate, 
25 spun down again, and resuspended in TE/lithium acetate. The resuspended 
yeast were incubated at 30°C for 60 minutes with shaking. Carrier DNA was 
added, and the yeast were aliquoted into tubes. Transforming DNA was added, 
and the tubes were incubated for 30 min. at 30°C. PEG solution (35% (w/v) 
PEG 4000, 100 mM lithium acetate, TE pH7.5) was added followed by a 50 
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min. incubation at 30°C. A 5 min. heat shock at 42°C was performed, the cells 
were pelleted, washed with TE, pelleted again and resuspended in TE. The 
resuspended cells were then plated on selective media. 

Pesaturase E xpression in Transformed Yeast 

5 cDNA clones from Mortierella alpina were screened for desaturase 

activity in baker's yeast. A canola A 1 5-desaturase (obtained by PCR using 1 st 
strand cDNA from Brassica napus cultivar 212/86 seeds using primers based on 
the published sequence (Arondel et al. Science 258:1353-1355)) was used as a 
positive control. The Al 5-desaturase gene and the gene from cDNA clone 

1 0 Ma29 was put in the expression vector p YES2 (Invitrogen), resulting in 

plasmids pCGR-2 and pCGR-4, respectively. These plasmids were transfected 
into S. cerevisiae yeast strain 334 and expressed after induction with galactose 
and in the presence of substrates that allowed detection of specific desaturase 
activity. The control strain was S. cerevisiae strain 334 containing the unaltered 

1 5 pYES2 vector. The substrates used, the products produced and the indicated 
desaturase activity were: DGLA (conversion to ARA would indicate A5- 
desaturase activity), linoleic acid (conversion to GLA would indicate A6- 
desaturase activity; conversion to ALA would indicate A 1 5-desaturase activity), 
oleic acid (an endogenous substrate made by S. cerevisiae, conversion to 

20 linoleic acid would indicate A12-desaturase activity, which S. cerevisiae lacks), 
or ARA (conversion to EPA would indicate Al 7-desaturase activity). The 
results are provided in Table 1 below. The lipid fractions were extracted as 
follows: Cultures were grown for 48-52 hours at 1 5°C. Cells were pelleted by 
centrifugation, washed once with sterile ddH 2 0, and repelleted. Pellets were 

25 vortexed with methanol; chloroform was added along with tritridecanoin (as an 
internal standard). The mixtures were incubated for at least one hour at room 
temperature or at 4°C overnight. The chloroform layer was extracted and 
filtered through a Whatman filter with one gram of anhydrous sodium sulfate to 
remove particulates and residual water. The organic solvents were evaporated 

30 at 40°C under a stream of nitrogen. The extracted lipids were then derivatized 

to fatty acid methyl esters (FAME) for gas chromatography analysis (GC) by 
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adding 2 ml of 0.5 N potassium hydroxide in methanol to a closed tube. The 
samples were heated to 95°C to 100°C for 30 minutes and cooled to room 
temperature. Approximately 2 ml of 14 % boron trifluoride in methanol was 
added and the heating repeated. After the extracted lipid mixture cooled, 2 ml 
5 of water and 1 ml of hexane were added to extract the FAME for analysis by 
GC. The percent conversion was calculated by dividing the product produced 
by the sum of (the product produced and the substrate added) and then 
multiplying by 100. To calculate the oleic acid percent conversion, as no 
substrate was added, the total linoleic acid produced was divided by the sum of 
10 (oleic acid and linoleic acid produced), then multiplying by 100. 
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Table 1 

M alvina Desaturase Expression in Baker's Yeast 

CLONE TYPE OF ENZYME % CONVERSION 

ACTIVITY OF SUBSTRATE 

pCGR-2 A6 0 (18:2 to 18:30)6) 

(canolaA15 A15 16.3 (18:2 to 18:3cd3) 

desaturase) A 5 2.0 (20:3 to 20:4o>6) 

A17 2.8 (20:4 to 20:5©3) 

A12 1.8 (18:1 to 18:2©6) 

pCGR-4 A6 0 

(M. alpina A15 0 

A6-Iike, Ma29) A5 15.3 

A17 0.3 

A12 3.3 

pCGR-7 A6 0 

(M. alpina A15 3.8 

A12-like, Ma648 A5 2.2 

A17 0 

A12 63.4 



The A15-desaturase control clone exhibited 16.3% conversion of the 
5 substrate. The pCGR-4 clone expressing the Ma29 cDNA converted 15.3% of 
the 20:3 substrate to 20:4w6, indicating that the gene encodes a A5-desaturase. 
The background (non-specific conversion of substrate) was between 0-3% in 
these cases. The pCGR-5 clone expressing the Ma524 cDNA showed 6% 
conversion of the substrate to GLA, indicating that the gene encodes a A6- 
10 desaturase. The pCGR-7 clone expressing the Ma648 cDNA converted 63.4% 
conversion of the substrate to LA, indicating that the gene encodes a A12- 
desaturase. Substrate inhibition of activity was observed by using different 
concentrations of the substrate. When substrate was added to 100 nM, the 
percent conversion to product dropped as compared to when substrate was 
1 5 added to 25 \xM (see below). These data show that desaturases with different 
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substrate specificities can be expressed in a heterologous system and used to 
produce PUFAs, 

Table 2 represents fatty acids of interest as a percent of the total lipid 
extracted from the yeast host S. cerevisiae 334 with the indicated plasmid. No 
5 glucose was present in the growth media. Affinity gas chromatography was 
used to separate the respective lipids. GC/MS was employed to verify the 
identity of the product(s). The expected product for the B. napus A15- 
desaturase, ct-linolenic acid, was detected when its substrate, linoleic acid, was 
added exogenously to the induced yeast culture. This finding demonstrates that 

1 0 yeast expression of a desaturase gene can produce functional enzyme and 
detectable amounts of product under the current growth conditions. Both 
exogenously added substrates were taken up by yeast, although slightly less of 
the longer chain PUFA, dihomo-y-linolenic acid (20:3), was incorporated into 
yeast than linoleic acid (18:2) when either was added in free form to the induced 

1 5 yeast cultures, y-linolenic acid was detected when linoleic acid was present 
during induction and expression of S. cerevisiae 334 (pCGR-5). The presence 
of this PUFA demonstrates A6-desaturase activity from pCGR-5 (MAS 24). 
Linoleic acid, identified in the extracted lipids from expression of S. cerevisiae 
334 (pCGR-7), classifies the cDNA MA648 from M alpina as the A 12- 

20 desaturase. 
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Example 7 

Expression of AS Desaturase in Plants 
Expression in Leaves 

This experiment was designed to determine whether leaves expressing 
5 Ma29 (as determined by Northern) were able to convert exogenously applied 
DGLA (20:3) to ARA (20:4). 

The Ma29 desaturase cDNA was modified by PCR to introduce 
convenient restriction sites for cloning. The desaturase coding region has been 
inserted into a d35 cassette under the control of the double 35S promoter for 
10 expression in Brassica leaves (pCGN5525) following standard protocols (see 
USPN 5,424,200 and USPN 5,106,739). Transgenic Brassica plants containing 
pCGN5525 were generated following standard protocols (see USPN 5,188,958 
and USPN 5,463,174). 

In the first experiment, three plants were used: a control, LP004-1, and 
1 5 two transgenics,, 5525-23 and 5525-29. LP004 is a low-linolenic Brassica 

variety. Leaves of each were selected for one of three treatments: water, GLA 
or DGLA. GLA and DGLA were purchased as sodium salts from NuChek Prep 
and dissolved in water at 1 mg/ml. Aliquots were capped under N 2 and stored at 
-70 degrees C. Leaves were treated by applying a 50 yd drop to the upper 
20 surface and gently spreading with a gloved finger to cover the entire surface. 
Applications were made approximately 30 minutes before the end of the light 
cycle to minimize any photo-oxidation of the applied fatty acids. After 6 days 
of treatment one leaf from each treatment was harvested and cut in half through 
the mid rib. One half was washed with water to attempt to remove 
25 unincorporated fatty acid. Leaf samples were lyophilized overnight, and fatty 
acid composition determined by gas chromatography (GC). The results are 
shown in Table 3. 
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Leaves treated with GLA contained from 1 .56 to 2.4 wt% GLA. The fatty acid 
analysis showed that the lipid composition of control and transgenic leaves was 
essentially the same. Leaves of control plants treated with DGLA contained 
1.2-1.9 w% DGLA and background amounts of ARA (.26-.27 wt%). 
5 Transgenic leaves contained only .2-.7 wt% DGLA, but levels of ARA were 
increased (.74-1 . 1 wt%) indicating that the DGLA was converted to ARA in 
these leaves. 

Expression in Seed 

The purpose of this experiment was to determine whether a construct 
1 0 with the seed specific napin promoter would enable expression in seed. 

The Ma29 cDNA was modified by PCR to introduce ATioI cloning sites 
upstream and downstream of the start and stop codons, respectively, using the 
following primers: 

Madxho-forward: 

1 5 5'-CUACU ACUACUACTCGAGC AAGATGGGAACGG ACC AAGG 

(SEQ ID NO:25) 

Madxho-reverse : 

S'-CAUCAUCAUCAUCTCGAGCTACTCTTCCTTGGGACGGAG 
(SEQ ID NO:26). 

20 The PCR product was subcloned into pAMP 1 (GIBCOBRL) using the 

CloneAmp system (GIBCOBRL) to create pCGN5522 and the A5 desaturase 
sequence was verified by sequencing of both strands. 

For seed-specific expression, the Ma29 coding region was cut out of 
pCGN5522 as anXhol fragment and inserted into the Sail site of the napin 
25 expression cassette, pCGN3223, to create pCGN5528. The HindlU fragment of 
pCGN5528 containing the napin 5' regulatory region, the Ma29 coding region, 
and the napin 3' regulatory region was inserted into the Hindlll site of 
pCGN1557 to create pCGN5531. Two copies of the napin transcriptional unit 
were inserted in tandem. This tandem construct can permit higher expression of 

-52- 



BNSDOC1D: <WO 9846764A1> 



WO 98/46764 



PCTYUS98/07421 



the desaturases per genetic loci. pCGN5531 was introduced into Brassica 
napus cv.LP004 via Agrobacterium mediated transformation. 

The fatty acid composition of twenty-seed pools of mature T2 seeds was 
analyzed by GC. Table 4 shows the results obtained with independent 
5 transformed lines as compared to non-transformed LP004 seed. The transgenic 
seeds containing pCGN553 1 contain two fatty acids that are not present in the 
control seeds, tentatively identified as taxoleic acid (5,9-18:2) and pinolenic 
acid (5,9,12-18:3), based on their elution relative to oleic and linoleic acid. 
These would be the expected products of A5 desaturation of oleic and linoleic 
1 0 acids. No other differences in fatty acid composition were observed in the 
transgenic seeds. 
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Northern analysis is performed on plants to identify those expressing 
Ma29. Developing embryos are isolated approximately 25 days post anthesis or 
when the napin promoter is induced, and floated in a solution containing GLA 
or DGLA as described in Example 7. Fatty acid analysis of the embryos is then 
5 performed by GC to determine the amount of conversion of DGLA to ARA, 

following the protocol adapted for leaves in Example 7. The amount of ARA 
incorporated into triglycerides by endogenous Brassica acyltransferases is then 
evaluated by GC analysis as in Example 7. 

Example 8 

*0 Expressi on of M. alvina A6 Desaturasc in Brassica nap us 

The Ma524 cDNA was modified by PCR to introduce cloning sites 
using the following primers: 

Ma524PCR-l (SEQ ID NO:27) 

1 5 5-CUACUACUACUATCTAGACTCGAGACCATGGCTGCTGCT 
CCAGTGTG 

Ma524PCR-2 (SEQ ID NO:28) 

5'-CAUCAUCAUCAUAGGCCTCGAGTTACTGCGCCTTACCCAT 

20 These primers allowed the amplification of the entire coding region and 

added Xbal and Xhol sites to the 5'-end and Xhol and Stul sites to the 3' end. 
The PCR product was subcloned into pAMPl (GIBCOBRL) using the 
CloneAmp system (GIBCOBRL) to create pCGN5535 and the A6 desaturase 
sequence was verified by sequencing of both strands. 

25 For seed-specific expression, the Ma524 coding region was cut out of 

pCGN5535 as an Xhol fragment and inserted into the Sail site of the napin 
expression cassette, pCGN3223, to create pCGN5536. The Notl fragment of 
pCGN5536 containing the napin 5' regulatory region, the Ma524 coding region, 
and the napin 3' regulatory region was inserted into the Noil site of pCGN1557 
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to create pCGN5538. pCGN5538 was introduced into Brassica napus 
cv.LP004 via Agrobacterium mediated transformation. 

Maturing T2 seeds were collected from 6 independent transformation 
events in the greenhouse. The fatty acid composition of single seeds was 
5 analyzed by GC Table 5 shows the results of control LP004 seeds and six 5538 

lines. All of the 5538 lines except #8 produced seeds containing GLA. 
Presence of GLA segregated in these seeds as is expected for the T2 selfed seed 
population. In addition to GLA, the M alpina A6 desaturase is capable of 
producing 18:4 (stearidonic) and another fatty acid believed to be the 6,9-18:2. 

1 0 The above results show that desaturases with three different substrate 

specificities can be expressed in a heterologous system and used to produce 
poly-unsaturated long chain fatty acids. Exemplified were the production of 
ARA (20:4) from the precursor 20:3 (DGLA), the production of GLA (18:3) 
from 18:2 substrate, and the conversion of 18:1 substrate to 18:2, which is the 

1 5 precursor for GLA. 
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Example 9 

Expression o f M. alpina A 12 desaturase in Brassica napus 
The Ma648 cDNA was modified by PCR to introduce cloning sites 
using the following primers: 

5 Ma648PCR-for (SEQ ID NO:29) 

5'-CUACUACUACUAGGATCCATGGCACCTCCCAACACT 
Ma648PCR-rev (SEQ ID NO:30) 

5*-CAUCAUCAUCAUGGTACCTCGAGTTACTTCTTGAAAAAGAC 

These primers allowed the amplification of the entire coding region and 
0 added a BamHI site to the 5' end and Kpnl and Xhol sites to the 3* end. The 

PCR product was subcloned into pAMPl (GIBCOBRL) using the CloneAmp 
system (GIBCOBRL) to create pCGN5540 and the A12 desaturase sequence 
was verified by sequencing of both strands. 

For seed-specific expression, the Ma648 coding region was cut out of 
5 pCGN5540 as a BamHI/XhoI fragment and inserted between the Bglll and 

Xhol sites of the napin expression cassette, pCGN3223, to create pCGN5542. 
The Asp71 8 fragment of pCGN5541 containing the napin 5' regulatory region, 
the Ma648 coding region, and the napin 3' regulatory region was inserted into 
the Asp71 8 site of pCGN5 138 to create pCGN5542. PCGN5542 was 
0 introduced into two varieties of Brassica napus via Agrobacterium mediated 

transformation. The commercial canola variety, SP30021, and a low-linolenic 
line, LP30108 were used. 

Mature selfed T2 seeds were collected from 19 independent LP30108 
transformation events and a non-transformed control grown in the greenhouse. 
5 These seeds are expected to be segregating for the A12 desaturase transgene. 

The fatty acid composition of20-seed pools was analyzed by GC. The results 
are shown in Table 6. All transformed lines contained increased levels of 18:2, 
the product of the A12 desaturase. Levels of 18:3 were not significantly 
increased in these plants. Events # 1 1 and 16 showed the greatest accumulation 
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of 18:2 in the pooled seeds. To investigate the segregation of 18:2 levels in the 
T2 seeds and to identify individual plants to be taken on to subsequent 
generations, half-seed analysis was done. Seeds were germinated overnight in 
the dark at 30 degrees on water-soaked filter paper. The outer cotyledon was 
excised for GC analysis and the rest of the seedling was planted in soil. Results 
of some of these analyses are shown in Table 7. Individual T2 seeds containing 
the M alpina A 12 desaturase accumulated up to 60% 18:2 in the seeds. Sample 
97xxl 1 1 6 #59 is an example of a null segregant. Even in the highest 1 8:2 
accumulators, levels of 18:3 were increased only slightly. These and other 
individually selected T2 plants were grown in the greenhouse and in the field to 
produce T3 seed. 

Mature selfed T2 seeds were collected from 20 independent SP30021 
transformation events and a non-transformed control grown in the greenhouse. 
These seeds are expected to be segregating for the A 12 desaturase transgene. 
The fatty acid composition of 20-seed pools was analyzed by GC. The data are 
presented in Table 8. All transformed lines contained increased levels of 1 8:2, 
the product of the A 12 desaturase. As in the low-linolenic LP30108 line, levels 
of 1 8:3 were not significantly increased. Events # 4 and 12 showed the greatest 
accumulation of 18:2 in the pooled seeds. To investigate the segregation of 
1 8:2 levels in the T2 seeds and to identify individual plants to be taken on to 
subsequent generations, alf-seed analysis was done. Seeds were germinated 
overnight in the dark at 30 degrees on water-soaked filter paper. The outer 
cotyledon was excised for GC analysis and the rest of the seedling was planted 
in soil. Results of some of these analyses are shown in Table 9. Samples 
97xxl 157 #88 and #18 are examples of null segregants for 5542-SP30021-4 and 
5542-SP30021-12 respectively. These and other individually selected T2 plants 
were grown in the greenhouse and in the field to produce T3 seed 
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Example 10 

Simulta neous expression ofM. alpina A6 and A12 
desaturases in Brassica nanus 

5 In order to express the M. alpina A6 and A 1 2 desaturases from the same 

T-DNA, the following construct for seed-specific expression was made. 

The NotI fragment of pCGN5536 containing the containing the napin 5' 
regulatory region, the Ma524 coding region, and the napin 3' regulatory region 
was inserted into the NotI site of pCGN5542 to create pCGN5544. The 
1 0 expression modules were oriented in such a way that the direction of 

transcription from Ma524 and Ma648 and the nptll marker is the same. 

PCGN5544 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 16 independent LP30108 transformation events and a non-transformed 
1 5 control that were grown in the greenhouse. These seeds are expected to be 

segregating for the A6+ A12 desaturase transgene. The fatty acid composition 
of 20-seed pools was analyzed by GC. The results are presented in Table 10. 
All but one of the lines (5544-LP30 108-3) shows an altered oil composition as 
compared to the controls. GLA was produced in all but three of the lines (-3, -4, 
20 -1 1); two of the three without GLA ( -4, -11) showed increased 18:2 indicative 

of expression of the A12 desaturase. As a group, the levels of GLA observed in 
plants containing the double A6 + A12 construct (pCGN5544) were higher than 
those of plants containing pCGN5538 (A6 alone). In addition, levels of the A 6,9 
1 8 :2 are much reduced in the plants containing the A 1 2 + A6 as compared to A6 
25 alone. Thus, the combination of A6 and A12 desaturases on one T-DNA leads 

to the accumulation of more GLA and fewer side products than expression of 
A6 desaturase alone. To investigate the segregation of GLA levels in the T2 
seeds and to identify individual plants to be taken on to subsequent generations, 
half-seed analysis was done. Seeds were germinated overnight in the dark at 30 
degrees on water-soaked filter paper. The outer cotyledon was excised for GC 
analysis and the rest of the seedling was planted in soil. Results of some of 
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these analyses are shown in Table 1 1. As expected for the T2 population, levels 
of GLA and 18:2 are segregating in the individual seeds. GLA content of up to 
60% of total fatty acids was observed in individual seeds. Individual events 
were selected to be grown in the greenhouse and field for production of T3 
5 seed. 

Transgenic plants including Brassica, soybean, safflower, corn flax and 
sunflower expressing the constructs of this invention can be a good source of 
GLA. 

Typical sources of GLA such as borage produce at most 25% GLA. In 
0 contrast the plants in Table 10 contain up to 30% GLA. Furthermore, the 

individual seeds shown in Table 1 1 contain up to 60% GLA. 
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Example 11 

Simultaneous expression of M. alpina AS and A6 
desaturases in Brassica napus 

In order to produce arachadonic acid (ARA) in transgenic canola oil 

both A5 and A6 desaturase activities need to be introduced. In order to facilitate 

downstream characterization and breeding, it may be advantageous to have both 

activities encoded by a single T-DNA. The following example illustrates the 

simultaneous expression of A5 and A6 desaturases. 

The Asp718 fragment of pCGN5528 containing the napin 5' regulatory 
region, the Ma29 coding region, and the napin 3' regulatory region was inserted 
into the Asp718 site of pCGN5138 to create pCGN5545. The NotI fragment of 
pCGN5536 containing the napin 5' regulatory region, the Ma524 coding region, 
and the napin 3' regulatory region was inserted into the NotI site of pCGN5545 
to create pCGN5546. The expression modules were oriented in such a way that 
the direction of transcription from Ma524 and Ma29 and the nptll marker is the 
same. 

PCGN5546 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 30 independent LP30108 transformation events that were grown in the 
greenhouse. The fatty acid composition of 20-seed pools was analyzed by GC. 
The results are shown in Table 12. All the lines show expression of both 
desaturases as evidenced by the presence of A 5,9 18:2 (as seen in pCGN5531 
plants) and A 6,9 1 8:2 and GLA (as seen in pCGN5538 plants) 
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Example 12 

Simultaneous expression of M. alpina AS. A6 and A12 
desaturases in Brassica nanus 

5 In order to achieve optimal production of ARA in transgenic canola oil 

both the A6 and A12 desaturase activities may need to be present in addition to 

the A5 activity. In order to facilitate downstream characterization and breeding, 

it may be advantageous to have all of these activities encoded by a single T- 

DNA. The following example illustrates the simultaneous expression of A5, A6 

1 0 and A 1 2 desaturases. 

The HindlH fragment of pCGN5528 containing the napin 5' regulatory 
region, the Ma29 coding region, and the napin 3' regulatory region was inserted 
into the HindlH site of pCGN5544 to create pCGN5547. The expression 
modules were oriented in such a way that the direction of transcription from 
1 5 Ma29, Ma524, Ma648 and the nptll marker is the same. 

PCGN5547 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 30 independent LP30108 transformation events that were grown in the 
greenhouse. The fatty acid composition of 20-seed pools was analyzed by GC. 

20 The results are shown in Table 13. Twenty-seven of the lines show significant 

accumulation of GLA and in general the levels of GLA observed are higher 
than those seen in the 5546 plants that did not contain the A 12 desaturase. The 
A12 desaturase appears to be active in most lines as evidenced by the lack of 
detectable A6,9 18:2 and elevated 18:2 levels in most plants. Small amounts of 

25 A5,9 1 8:2 are seen in the 5547 plants, although the levels are generally less than 

those observed in the 5546 plants. This may be due to the presence of the A12 
desaturase which efficiently converts the 18:1 to 18:2 before it can be 
desaturated at the A5 position. 
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Example 13 

Stereospecific Distribution of A6-Desaturated Oils 

This experiment was designed to investigate the stereospecific 
distribution of the A6-desaturated oils in seeds expressing pCGN5538 (Ma 524 
5 cDNA). Three seed samples were used: 

1) Non-transformed B. napus cv. LP004 seeds (control) 

2) Segregating T2 seeds of pCGN5538-LP004-19 

3) Segregating T2 seeds of pCGN5538 LP004-29 
The following protocol was used for the analysis: 

10 1. Seed Oil Extraction 

Fifty seeds were placed in a 12 x 32 mm vial and crushed with a glass 
rod. 1 .25 mL hexane was added and the mixture was vortexed. The seeds were 
extracted overnight on a shaker. The extract was then filtered through a 0.2 
micron filter attached to a lcc syringe. The extract was then dried down under 
1 5 nitrogen. The resulting oil was used for digestion and derivatization of the 

whole oil sample. 

2. Digestion 

A. Liquid Oil Digestion 

The stock lipase (from Rhizopus arrhizus, Sigma, L4384) was diluted to 
20 approximately 600,000 units/mL with a goal of obtaining 50% digestion of the 

TAG. The stock lipase is maintained at 4 degrees C and placed on ice. The 
amount of reagents may be adjusted according to the amount of oil to be 
digested. 

The following amounts are based on a 2.0 mg extracted oil sample. In a 
25 12 x 32 mm screw cap vial the following were added: 2.0 mg oil, 200 0.1 M 

tris HC1 pH 7, 40 [iL 2.2 w/v% CaCl 2 2H 2 0, and 100 \lL 0.05 w/v % bile salts. 
The material was vortexed and sonicated to disperse the oil. Twenty \iL of 
diluted lipase was added and the mixture was vortexed continuously for 1 .0 
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minute at room temperature. A white precipitate formed. The reaction was 
stopped with 100 uL 6M HC1 and vortexing. Five hundred uL CHCl 3 :CH 3 OH 
(2:1) was added and the mixture was vortexed and held on ice while reaining 
digestions were carried out. Samples were vortexed again and centrifuged 
5 briefly to sharpen layers. The lower layer containing digest products was 

removed with a pasteur pipette and placed in a 12 x 32 mm crimp cap vial. The 
material was then re-extracted with 300 uL CHC1 3 , vortexed, centrifuged, and 
combined with the lower layers. The digest products were kept on ice as much 
as possible. HPLC separation is performed as soon as possible after digestion to 
10 minimize acyl migration. 

B. Solid Fat Digestion 

The procedure for liquid oil digestion described above was followed 
except that 20 \il 1 1 :0 methyl ester is added to 2.0 mg solid fat. 

3. HPLC Separation 

1 5 The digestion products were dried down in chloroform to approximately 

200 |iL. Each sample was then transferred into an insert in an 8 x 40 mm shell 
vial and 30 \xL was injected for HPLC analysis. 

The high performance liquid chromatographic system was equipped 
with a Varex ELSD IIA evaporative light scattering detector with tube 
20 temperature at 105°C and nitrogen gas flow at 40 mL/min; a Waters 712 Wisp 

autosampler, three Beckman 1 14M Solvent Delivery Modules; a Beckman 
421 A controller, a Rheodyne pneumatically actuated stream splitter; and a 
Gilson micro fractionator. The chromatography column is a 220 x 4.6 mm, 5 
micron normal phase silica cartridge by Brownlee. 

25 The three solvents used were: 

A= hexanertoluene 1 : 1 

B= toluene: ethyl acetate 3:1 

C= 5% formic acid in ethyl acetate 
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The gradient profile was as follows: 



Time (min) 


Function 


Value 


Duration 


Oflow 


2.0 mL/min 






0%B 


10 






0%C 


2 






2%C 


25 




6 min 


14.0 % C 


2 




1 min 


15.0 


End program 







A chromatographic standard mixture is prepared in hexanertoluene 1:1 
containing the following: 

0.2 mg/mL triglyceride 16:0 



5 2.0 mg/mL 16:0 Free Fatty Acid 

0.2 mg/mL di 16:0 mixed isomers (1,2-diacylglycerol and 1,3-diacyl glycerol) 
0.2 mg/mL 3-mono acyl glycerol 16:0 
0.2 mg/mL 2-mono acylglycerol 16:0 

For each sample, the fraction containing the 2-mag peak is collected 
10 automatically by method controlled timed events relays. A time delay is used to 

synchronize the detector with the collector's emitter. The 2-mag peaks are 
collected and the fractions are evaporated at room temperature overnight. 

The sn-2 composition results rely on minimization of acyl migration. 
Appearance of 1-monoacylglycerol and/or 3-monoacylglycerol peaks in the 
15 chromatograph means that acyl migration has occurred. 

4. Derivatization 

To derivatize the whole oil, 1 .0 mg of the extracted whole oil was 
weighed into a 12 x 32 mm crimp cap vial. One mL toluene was then added. 
The sample is then vortexed and a 50 \xL aliquot was removed for 
20 derivatization. To the dried down 2-mag samples, 50 |aL toluene was added. To 

both the whole oil and 2-mag fractions 105 uL H2SO4/CH3OH @ 8.76 wt% is 
added. The cap was tightly capped and the sample is refluxed for 1 hour at 95 
degrees C. The sample was allowed to cool and 500 uL 10 w/v % NaCl in 
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water and 60 uL heptane was added. The organic layer was removed and 
inserted in a 12 x 32 mm crimp cap vial. 

5. GLC Analysis 

A Hewlett Packard model 6890 GC equipped with a split/splitless 
capillary inlet, FID detector, 6890 series autosampler and 3392A Alpha Omega 
integrator is set up for the capillary column as follows: 



Supelco Omegawax 250, 30 m length, 0.25 mm id, 0.25 um film 
thickness 



injection port: 


260 C 


detector: 


270 C 


initial temp: 


170 C 


initial time: 


1.5 min 


rate: 


30 deg/min 


final temp: 


245 C 


final time: 


6.5 min 


injection vol: 


1.5 uL 


head pressure: 


25 psi 


split ratio: 


30 


carrier gas: 


He 


make-up gas: 


N 2 


FID gas: 


H + air 



Percent compositions of fatty acid methyl esters are calculated as mole 
percents. For carbon chain lengths less than 12, the use of theoretical or 
empirical response factors in the area percent calculation is desirable. 
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6. Calculations 

The mean distribution of each acyl group at each sn-l and sn-3 position 
was calculated, 

mean sn-l and sn-3 composition = (3 WO comp - MAG comp) / 2 
5 WO = whole oil 

MAG= monoacylglycerol 

The results of this analysis are presented in Table 14. The GLA and A 6,9 
1 8:2 are evenly distributed between the sn-2 and sn-l, 3 positions. This 
analysis can not discriminate between fatty acids in the sn-l vs. sn-3 positions. 
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Example 14 

Fatty Acid Compositions of Transgenic Plants 

A5 and A6 transgenic plants were analyzed for their fatty acid content. 

The following protocol was used for oil extraction: 

5 1 . About 400 mg of seed were weighed out in duplicate for each 

sample. 

2. The seeds were crushed in a motar and pestle. The mortar and 
pestle was rinsed twice with 3ml (2:1) (v:v) 
CHCl 3 :CH 3 OH/MeOH. An additional 6 ml (2:1) was added to 

10 the 20ml glass vial (oil extracted in 12ml total 2:1). 

3. Samples were vortexed and placed on an orbital shaker for 2 
hours with occasional vortexing. 

4. SmloflMNaCl was added to each sample. Sample was 
vortexed then spun in centrifuge at 2000rpm for 5 minutes. 

1 5 Lower phase was drawn off using a pasteur pipette. 

5. Upper phase was re-extracted with an additional 5ml. Sample 
was vortexed then spun in centrifuge at 2000 rpm for 5 minutes. 
The lower phase was drawn off using a pasteur pipette and added 
to previous lower phase. 

20 6. CHC1 3 :CH30H /MeOH was evaporated under nitrogen using 

evaporative cooling. Vial containing extracted oil was sealed 
under nitrogen. Between 120mg- 160mg oil was extracted for 
each sample. 

For GC-MS analysis, fatty acid methyl esters were dissolved in an 
25 appropriate volume of hexane and analyzed using a Hewlett-Packard 5890 

Series II Plus gas chromatograph (Hewlett Packard, Palo Alto, CA) equipped 
with a 30 m x 0.32 mm i.d. Omegawax 320 fused sillica capillary column 
(Supelco, Bellefonte, PA) and a Hewlett-Packard 5972 Series mass selective 
detector. Mass spectra were intrepreted by comparison to the mass spectra in 
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NIST/EPA/NIH Chemical Structure Database using a MS Chem Station 
(#G1036A) (Hewlett Packard). 

Transgenic line 5531-6 was analyzed in duplicate (A, B) and compared 
to control line LP004-6. The fatty acid profile results are shown in Table 15. 

Transgenic line 5538-19 was analyzed in duplicate (A, B) and compared 
to control line LP004-6. The fatty acid profile results are shown in Table 16. 
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Table 15 
Fatty Acid Profile 
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TRANSGFNIC 
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001(0103 d 


001 mini a 
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C13:0 










C14:0 




0.053 




0 061 


C14:l 










C15:0 isomer 










C15:0 










C16:0 


4.107 


4.034 


4.257 


4 224 


€16:1 


0.181 


0.173 


0.200 


0.199 


C16:2 


0.061 


0.065 


0.081 


0.060 


C17:0 










C16:3 


0.244 


0.246 


0.155 


0.151 


C16:4 










C18:0 


2.608 


2.714 


3.368 


3.417 


C18:lw9 


65.489 


66.454 


59.529 


59.073 


C18:lw7 


2.297 


2.185 


2.388 


2.393 


CI8:2 5,9 
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Table IS 
Fattv Acid Profile 







CD IN 1 KUL 


TRANSG EN IC 


TRANSGENIC 
















L*rUU4-Ot> 


553 1-6 A 


553I-6B 


















L.KL-Z042 


LRL-2045 








uuifuioi.a 


001f0104.d 


C20*4w6 










C20*3w3 










C20:4w3 










C20*5w3 














U.33o 


0.463 


0.467 






A A'JQ 

U.UJo 




















U.UJ4 
















C23:0 




0.029 






C22:4w6 










C22:5w6 










C22:5w3 










C24:0 


0.373 


0.391 


0.280 


0.283 


C22:6w3 


0.314 


0.317 


0.223 


0.212 


C24:lw9 




















TOTAL 


100.00 


100.00 


100.00 


100.00 
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Table 16 
Fatty Acid Pr file 
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Table 16 
Fattv Acid Pr file 
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TOTAL 


100.00 


100.00 


100.00 


100.00 
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Example 15 

Combined Expression of A6 and A12 
Desaturases in B. napus Achieved by Crossing 

Plants containing either the A6 or the A 12 desaturase were crossed and 

5 individual Fl half-seeds were analyzed for fatty acid composition by GC. Data 

from one such cross are given in Table 1 7. The parents for the cross were 

5538-LP004-25-2-25 (A6 expressor) and 5542-SP30021-10-16 (A12 expressor). 

Reciprocal crosses were made and the results of 25 individual Fl seeds of each 

are shown in the table. Crosses are described such that the first parent indicated 

10 is the female. Both sets of crosses gave approximately the same results. 

Compared to the parents, the A 6,9 1 8:2 decreased, and the GLA increased. A 9,12 
18:2 levels are increased in most of the FTs as well. Note that these are Fl 
seeds and only contain one set of each desaturase. In future generations and 
selection of events homo2ygous for each desaturase, the F2 GLA levels 

1 5 obtained may be even higher. 

Combining traits by crossing may be preferable to combining traits on 
. one T-DN A in some situations. Particularly if both genes are driven off of the 
same promoter (in this case napin), issues of promoter silencing may favor this 
approach over putting nultiple cDNAs on one construct. 

20 Alternatively, in some cases, combining multiple cDNAs on one T-DNA 

may be the method of choice. The results are shown in Table 17. 
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Example 16 

Expression of M. alnina desaturases in soybean 
The M. alpina desaturases can be used to drive production of GLA and 
other PUFAs in soybean by use of the following expression constructs. Two 
5 means by which exogenous DNA can be inserted into the soybean genome are 

Agrobacterium infection or particle gun. Particle gun transformation is 
disclosed in U.S. patent 5,503,998. Plants can be selected using a glyphosate 
resistance marker (4, 971 , 908). Agrobacterium transformation of soybean is 
well established to one of ordinary skill in the art. 

1 0 For seed specific expression, the coding regions of the desaturase 

cDNAs are placed under control of the 5* regulatory region of Glycine max 
alpha-type beta conglycinin storage protein gene. The specific region that can 
be used is nucleotides 78-921 of gi 169928 (Doyle, J. J., Schuler, MA., " 
Godette, W.D., Zenger, V., Beachy, R.N., and Slightom. J.L., 1986 J. Biol. 

15 Chem. 261 (20), 9228-9238). The 3* regulatory region that can be used is from 

the pea ribulose 1,5 bisphosphate carboxylase/oxygenase small subunit (rbcS) 
- gene. The specific sequences to be used are nucleotides 1-645 of gi 169145 
(Hunt, A.G. 1988 DNA 7: 329-336). 

Since soybean seeds contain more 18:2, and perhaps more endogenous 
20 A 12 desaturase activity than Brassica, the effect of the Mortierella A 12 

desaturase on achieving optimal GLA levels can be tested as follows. A 
construct containing the A6 cDNA can be used to see if A 6,9 18:2 is produced 
along with GLA. A construct containing the A12 desaturase can be used to see 
if the amount of 1 8:2 can be increased in soybean. A construct containing both 
25 the A6 and A12 desaturases can be used to produce optimal levels of GLA. 

Alternatively, plants containing each of the single desaturases may be crossed if 
necessary to combine the genes. 

Similar constructs may be made to express the A5 desaturase alone, or in 
combination with A 12 and/or A6 desaturases. 
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Example 17 

Human Desaturase Gene Sequences 
Human desaturase gene sequences potentially involved in long chain 
polyunsaturated fatty acid biosynthesis were isolated based on homology 
5 between the human cDNA sequences and Mortxerella alpina desaturase gene 

sequences. The three conserved "histidine boxes" known to be conserved 
among membrane-bound desaturases were found. As with some other 
membrane-bound desaturases the final HXXHH histidine box motif was found 
to be QXXHH. The amino acid sequence of the putative human desaturases 
1 0 exhibited homology to M. alpina A5, A6, A9, and A 1 2 desaturases. 

The M. alpina A5 desaturase and A6 desaturase cDNA sequences were 
used to search the LifeSeq database of Incyte Pharmaceuticals, Inc., Palo Alto, 
California 94304. The A5 desaturase sequence was divided into fragments; 1) 
amino acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301- 

15 446. The A6 desaturase sequence was divided into three fragments; 1) amino 

acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301-457. 
These polypeptide fragments were searched against the database using the 
"tblastn" algorithm. This alogarithm compares a protein query sequence against 
a nucleotide sequence database dynamically translated in all six reading frames 

20 (both strands). 

The polypeptide fragments 2 and 3 of M. alpina A5 and A6 have 
homologies with the ClonelD sequences as outlined in Table 18. The ClonelD 
represents an individual sequence from the Incyte LifeSeq database. After the 
"tblastn" results have been reviewed, Clone Information was searched with the 

25 default settings of Stringency of >=50, and Productscore <=1 00 for different 

ClonelD numbers. The Clone Information Results displayed the information 
including the ClusterlD, ClonelD, Library, HitID, Hit Description. When 
selected, the ClusterlD number displayed the clone information of all the clones 
that belong in that ClusterlD. The Assemble command assembles all of the 

30 ClonelD which comprise the ClusterlD. The following default settings were 
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used for GCG (Genetics Computer Group, University of Wisconsin 
Biotechnology Center, Madison, Wisconsin 53705) Assembly: 

Word Size: 7 

5 Minimum Overlap : 1 4 

Stringency: 0.8 

Minimum Identity : 1 4 

Maximum Gap: 10 

Gap Weight: 8 

10 Length Weight: 2 

GCG Assembly Results displayed the contigs generated on the basis of 
sequence information within the ClonelD. A contig is an alignment of DNA 
sequences based on areas of homology among these sequences. A new 

15 sequence (consensus sequence) was generated based on the aligned DNA 

sequences within a contig. The contig containing the ClonelD was identified, 
and the ambiguous sites of the consensus sequence was edited based on the 
alignment of the ClonelDs (see SEQ ID NO:3 1 - SEQ ID NO:35) to generate 
the best possible sequence. The procedure was repeated for all six ClonelD 

20 listed in Table 18. This produced five unique contigs. The edited consensus 

sequences of the 5 contigs were imported into the Sequencher software program 
(Gene Codes Corporation, Ann Arbor, Michigan 48 105). These consensus 
sequences were assembled. The contig 251 1785 overlaps with contig 3506132, 
and this new contig was called 2535 (SEQ ID NO:37). The contigs from the 

25 Sequencher program were copied into the Sequence Analysis software package 

of GCG. 

Each contig was translated in all six reading frames into protein 
sequences. The M. alpina A5 (MA29) and A6 (MA524) sequences were 
compared with each of the translated contigs using the FastA search (a Pearson 
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A 



and Lipman search for similarity between a query sequence and a group of 
sequences of the same type (nucleic acid or protein)). Homology among these 
sequences suggest the open reading frames of each contig. The homology 
among the M alpina A5 and A6 to contigs 2535 and 3854933 were utilized to 
create the final contig called 253538a. Figure 9 is the FastA match of the final 
contig 253538a and MA29, and Figure 10 is the FastA match of the final contig 
253538a and MA524. The DNA sequences for the various contigs are 
presented in SEQ ID NO:31 -SEQ ID NO:37 The various peptide sequences 
are shown in SEQ ID NO:38 - SEQ ID NO: 44. 

Although the open reading frame was generated by merging the two 
contigs, the contig 2535 shows that there is a unique sequence in the beginning 
of this contig which does not match with the contig 3854933. Therefore, it is 
possible that these contigs were generated from independent desaturase like 
human genes. 

The contig 253538a contains an open reading frame encoding 432 
amino acids. It starts with Gin (CAG) and ends with the stop codon (TGA). 
The contig 253538a aligns with both M alpina A5 and A6 sequences, 
suggesting that it could be either of the desaturases, as well as other known 
desaturases which share homology with each other. The individual contigs 
listed in Table 18, as well as the intermediate contig 2535 and the final contig 
253538a can be utilized to isolate the complete genes for human desaturases. 

Uses of the Human Desaturases 

These human sequences can be expressed in yeast and plants utilizing 
the procedures described in the preceding examples. For expression in 
mammalian cells and transgenic animals, these genes may provide superior 
codon bias. In addition, these sequences can be used to isolate related 
desaturase genes from other organisms. 
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Table 18 



Sections of the 
Desatu rases 


Clone ID from LifeSeq Database 


Keyword 


151-300 A5 


3808675 


fatty acid desaturase 


301-446 A5 


354535 


A6 


151-300 A6 


3448789 


A6 


151-300 A6 


1362863 


A6 


151-300 A6 


2394760 


A6 


301-457 A6 


3350263 


A6 



Example 18 

Identification of Homologues to M alpina AS and A6 desaturases 

A nucleic acid sequence that encodes a putative A5 desaturase was 
identified through a TBLASTN search of the expressed sequence tag databases 
through NCBI using amino acids 100-446 of Ma29 as a query. The truncated 
portion of the Ma29 sequence was used to avoid picking up homologies based 
on the cytochrome b5 portion at the N-terminus of the desaturase. The deduced 
amino acid sequence of an est from Dictyostelium discoideum (accession # 
- C25549) shows very significant homology to Ma29 and lesser, but still 
significant homology to Ma524. The DNA sequence is presented as SEQ ID 
NO:45. The amino acid sequence is presented as SEQ ID NO:46. 

Example 19 

Identification of M alpina AS and A6 homologues in other 
PUFA-producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Phaeodactylum tricornutum. A 

plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 

following manufacturer's instructions using a commercially available kit 

(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 

sequences that encode putative A5 or A6 desaturases were identified through 

BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 
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One clone was identified from the Phaeodactylum library with 
homology to Ma29 and Ma524; it is called 144-01 1-B 12. The DNA sequence is 
presented as SEQ ID NO:47. The amino acid sequence is presented as SEQ ID 
NO:48. 

5 Example 20 

Identification of Af, giving A5 and A6 homologues in other 
PUFA -producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Schizochytrium species. A 

10 plasmid-based cDNA library was constructed in pSPORTl (GBBCO-BRL) 

following manufacturer's instructions using a commercially available kit 

(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 

sequences that encode putative A5 or A6 desaturases were identified through 

BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 

15 One clone was identified from the Schizochytrium library with 

homology to Ma29 and Ma524; it is called 81 -23-C7. This clone contains a -1 
kb insert. Partial sequence was obtained from each end of the clone using the 
universal forward and reverse sequencing primers. The DNA sequence from 
the forward primer is presented as SEQ ID NO:49. The peptide sequence is 

20 presented as SEQ ID NO:50. The DNA sequence from the reverse primer is 

presented as SEQ ID NO: 5 1 . The amino acid sequence from the reverse primer 
is presented as SEQ ID NO:52. 

Example 21 

Nutritional Compositions 

25 The PUFAs of the previous examples can be utilized in various 

nutritional supplements, infant formulations, nutritional substitutes and other 
nutrition solutions. 

L INFANT FORMULATIONS 

A. Isomil® Soy Formula with Iron. 
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Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's milk. A feeding for patients with disorders for which 
lactose should be avoided: lactase deficiency, lactose intolerance and 
galactosemia. 

5 Features: 

• Soy protein isolate to avoid symptoms of cow's-milk-protein 
allergy or sensitivity 

• Lactose-free formulation to avoid lactose-associated diarrhea 

• Low osmolaity (240 mOsm/kg water) to reduce risk of osmotic 
1 0 diarrhea. 

• Dual carbohydrates (corn syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

• 1 .8 mg of Iron (as ferrous sulfate) per 1 00 Calories to help 
1 5 prevent iron deficiency. 

• Recommended levels of vitamins and minerals. 

• Vegetable oils to provide recommended levels of essential fatty 
acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

20 Ingredients: (Pareve, ©) 85% water, 4.9% corn syrup, 2.6% sugar 

(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1 % calcium phosphate tribasic, potassium citrate, potassium 
phosphate monobasic, potassium chloride, mono- and disglycerides, soy 
lecithin, carrageenan, ascorbic acid, L-methionine, magnesium chloride, 

25 potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 

sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
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acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin 

B. Isomil® DF Soy Formula For Diarrhea. 

Usage: As a short-term feeding for the dietary management of diarrhea 
5 in infants and toddlers. 

Features: 

• First infant formula to contain added dietary fiber from soy fiber 
specifically for diarrhea management. 

• Clinically shown to reduce the duration of loose, watery stools 
1 0 during mild to severe diarrhea in infants. 

• Nutritionally complete to meet the nutritional needs of the infant. 

• Soy protein isolate with added L-methionine meets or exceeds an 
infant's requirement for all essential amino acids. 

• Lactose-free formulation to avoid lactose-associated diarrhea. 

1 5 • Low osmolality (240 mOsm/kg water) to reduce the risk of 

osmotic diarrhea. 

• Dual carbohydrates (corn syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

20 • Meets or exceeds the vitamin and mineral levels recommended 

by the Committee on Nutrition of the American Academy of Pediatrics 
and required by the Infant Formula Act. 

• 1 .8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

25 # Vegetable oils to provide recommended levels of essential fatty 

acids. 

Ingredients: (Pareve, ©) 86% water, 4.8% corn syrup, 2.5% sugar 
(sucrose), 2.1% soy oil, 2.0% soy protein isolate, 1.4% coconut oil, 0.77% soy 
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fiber, 0.12% calcium citrate, 0.1 1 % calcium phosphate tribasic, 0.10% 
potassium citrate, potassium chloride, potassium phosphate monobasic, mono- 
and disglycerides, soy lecithin, carrageenan, magnesium chloride, ascorbic acid, 
L-methionine, potassium phosphate dibasic, sodium chloride, choline chloride, 
5 taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L- 

carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin A 
palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

1 0 C. Isomil® SF Sucrose-Free Soy Formula With Iron. 

Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's-milk protein or an intolerance to sucrose. A feeding for 
patients with disorders for which lactose and sucrose should be avoided. 

Features: 

1 5 • Soy protein isolate to avoid symptoms of cow's-milk-protein 

allergy or sensitivity. 

• Lactose-free formulation to avoid lactose-associated diarrhea 
(carbohydrate source is Polycose® Glucose Polymers). 

• Sucrose free for the patient who cannot tolerate sucrose. 

20 • Low osmolality ( 1 80 mOsm/kg water) to reduce risk of osmotic 

diarrhea. 

• 1 .8 mg of iron (as ferrous sulfate) per 100 Calories to help 
prevent iron deficiency. 

• Recommended levels of vitamins and minerals. 

25 • Vegetable oils to provide recommended levels of essential fatty 

acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

Ingredients: (Pareve, ©) 75% water, 1 1.8% hydrolized cornstarch, 4.1% 
soy oil, 4.1% soy protein isolate, 2.8% coconut oil, 1.0% modified cornstarch, 
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0.38% calcium phosphate tribasic, 0.17% potassium citrate, 0.13% potassium 
chloride, mono- and disglycerides, soy lecithin, magnesium chloride, abscorbic 
acid, L-methionine, calcium carbonate, sodium chloride, choline chloride, 
carrageenan, taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc 
5 sulfate, L-carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin 

A palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

D, Isomil® 20 Soy Formula With Iron Ready To Feed, 
10 20Cal/floz. 

Usage: When a soy feeding is desired. 

Ingredients: (Pareve, ©) 85% water, 4.9% corn syrup, 2.6% sugar 
(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1% calcium phosphate tribasic, potassium citrate, potassium 

15 phosphate monobasic, potassium chloride, mono- and disglycerides, soy 

lecithin, carrageenan, abscorbic acid, L-methionine, magnesium chloride, 
potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 
sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 

20 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin. 

E. Similac® Infant Formula 

Usage: When an infant formula is needed: if the decision is made to 
25 discontinue breastfeeding before age 1 year, if a supplement to breastfeeding is 

needed or as a routine feeding if breastfeeding is not adopted. 



-110- 



BNSDOCID:<WO 9846764A1> 



WO 98/46764 PCT/US98/0742 1 



Features: 

• Protein of appropriate quality and quantity for good growth; 
heat-denatured, which reduces the risk of milk-associated enteric blood 
loss. 

5 • Fat from a blend of vegetable oils (doubly homogenized), 

providing essential linoleic acid that is easily absorbed. 

• Carbohydrate as lactose in proportion similar to that of human 
milk. 

• Low renal solute load to minimize stress on developing organs. 

1 0 • Powder, Concentrated Liquid and Ready To Feed forms. 

Ingredients: (®-D) Water, nonfat milk, lactose, soy oil, coconut oil, 
mono- and diglycerides, soy lecithin, abscorbic acid, carrageenan, choline 
chloride, taurine, m-inositol, alpha-tocopheryl acetate, zinc sulfate, niacinamid, 
ferrous sulfate, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
1 5 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, phylloquinone, biotin, sodium selenite, vitamin D 3 and 
cyanocobalamin 

F. Similac® NeoCare Premature Infant Formula With Iron 

Usage: For premature infants' special nutritional needs after hospital 
20 discharge. Similac NeoCare is a nutritionally complete formula developed to 

provide premature infants with extra calories, protein, vitamins and minerals 
needed to promote catch-up growth and support development. 

Features: 

• Reduces the need for caloric and vitamin supplementation. More 
25 calories (22 Cal/fl oz) then standard term formulas (20 Cal/fl oz). 

• Highly absorbed fat blend, with medium-chain triglycerides 
(MCT oil) to help meet the special digestive needs of premature infants. 

• Higher levels of protein, vitamins and minerals per 100 Calories 
to extend the nutritional support initiated in-hospital. 
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• More calcium and phosphorus for improved bone mineralization. 

Ingredients: ®-D Corn syrup solids, nonfat milk, lactose, whey protein 
concentrate, soy oil, high-oleic safflower oil, fractionated coconut oil (medium- 
chain triglycerides), coconut oil, potassium citrate, calcium phosphate tribasic, 
5 calcium carbonate, ascorbic acid, magnesium chloride, potassium chloride, 

sodium chloride, taurine, ferrous sulfate, m-inositol, choline chloride, ascorbyl 
palmitate, L-carnitine, alpha-tocopheryl acetate, zinc sulfate, niacinamide, 
mixed tocopherols, sodium citrate, calcium pantothenate, cupric sulfate, 
thiamine chloride hydrochloride, vitamin A palmitate, beta carotene, riboflavin, 
10 pyridoxine hydrochloride, folic acid, manganese sulfate, phylloquinone, biotin, 

sodium selenite, vitamin D3 and cyanocobalamin. 

G. Similac Natural Care Low-Iron Human Milk Fortifier Ready 
To Use, 24 Cal/fl 02. 

Usage: Designed to be mixed with human milk or to be fed alternatively 
1 5 with human milk to low-birth- weight infants. 

Ingredients: ®-D Water, nonfat milk, hydrolyzed cornstarch, lactose, 
fractionated coconut oil (medium-chain triglycerides), whey protein 
concentrate, soil oil, coconut oil, calcium phosphate tribasic, potassium citrate, 
magnesium chloride, sodium citrate, ascorbic acid, calcium carbonate, mono- 

20 and diglycerides, soy lecithin, carrageenan, choline chloride, m-inositol, taurine, 

niacinamide, L-carnitine, alpha tocopheryl acetate, zinc sulfate, potassium 
chloride, calcium pantothenate, ferrous sulfate, cupric sulfate, riboflavin, 
vitamin A palmitate, thiamine chloride hydrochloride, pyridoxine 
hydrochloride, biotin, folic acid, manganese sulfate, phylloquinone, vitamin D 3 , 

25 sodium selenite and cyanocobalamin. 

Various PUFAs of this invention can be substituted and/or added to the 
infant formulae described above and to other infant formulae known to those in 
the art.. 
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II. NUTRITIONAL FORMULATIONS 
A. ENSURE® 

Usage: ENSURE is a low-residue liquid food designed primarily as an 
oral nutritional supplement to be used with or between meals or, in appropriate 
5 amounts, as a meal replacement. ENSURE is lactose- and gluten-free, and is 

suitable for use in modified diets, including low-cholesterol diets. Although it 
is primarily an oral supplement, it can be fed by tube. 

Patient Conditions: 

• For patients on modified diets 

1 0 • For elderly patients at nutrition risk 

• For patients with involuntary weight loss 

• For patients recovering from illness or surgery 

• For patients who need a low-residue diet 
Ingredients: 

1 5 _ ®-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium and Sodium 

Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, Canola Oil, 
Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, Magnesium 
Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium Chloride, 
Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc Sulfate, 

20 Ferrous Sulfate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 

Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
Riboflavin, Folic Acid, Sodium Molybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate. 



25 



B. ENSURE® BARS 

Usage: ENSURE BARS are complete, balanced nutrition for 
supplemental use between or with meals. They provide a delicious, nutrient- 
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rich alternative to other snacks. ENSURE BARS contain <1 g lactose/bar, and 
Chocolate Fudge Brownie flavor is gluten-free. (Honey Graham Crunch flavor 
contains gluten.) 

Patient Conditions: 
5 • For patients who need extra calories, protein, vitamins and minerals 

• Especially useful for people who do not take in enough calories and 
nutrients 

• For people who have the ability to chew and swallow 

• Not to be used by anyone with a peanut allergy or any type of allergy to 
10 nuts. 

Ingredients: 

Honey Graham Crunch High-Fructose Corn Syrup, Soy Protein- 
Isolate, Brown Sugar, Honey, Maltodextrin (Corn), Crisp Rice (Milled Rice, 
Sugar [Sucrose], Salt [Sodium Chloride] and Malt), Oat Bran, Partially 
15 Hydrogenated Cottonseed and Soy Oils, Soy Polysaccharide, Glycerine, Whey 

Protein Concentrate, Polydextrose, Fructose, Calcium Caseinate, Cocoa 
Powder, Artificial Flafors, Canola Oil, High-Oleic Safflower Oil, Nonfat Dry 
Milk, Whey Powder, Soy Lecithin and Corn Oil. Manufactured in a facility that 
processes nuts. 

20 Vitamins and Minerals: 

Calcium Phosphate Tribasic, Potassium Phosphate Dibasic, Magnesium 
Oxide, Salt (Sodium Chloride), Potassium Chloride, Ascorbic Acid, Ferric 
Orthophosphate, Alpha-Tocopheryl Acetate, Niacinamide, Zinc Oxide, Calcium 
Pantothenate, Copper Gluconate, Manganese Sulfate, Riboflavin, Beta- 
25 Carotene, Pyridoxine Hydrochloride, Thiamine Mononitrate, Folic Acid, Biotin, 

Chromium Chloride, Potassium Iodide, Sodium Selenate, Sodium Molybdate, 
Phylloquinone, Vitamin D 3 and Cyanocobalamin. 
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Protein: 

Honey Graham Crunch - The protein source is a blend of soy protein isolate 
and milk proteins. 

Soy protein isolate 74% 
5 Milk proteins 26% 

Fat: 

Honey Graham Crunch - The fat source is a blend of partially 
hydrogenated cottonseed and soybean, canola, high oleic safflower, and corn 
oils, and soy lecithin. 

1 0 Partially hydrogenated cottonseed and soybean oil 76% 



Canola oil 8% 

High-oleic safflower oil 8% 

Corn oil 4% 

Soy lecithin 4% 

15 Carbohydrate: 



Honey Graham Crunch - The carbohydrate source is a combination of 
high-fructose corn syrup, brown sugar, maltodextrin, honey, crisp rice, 
glycerine, soy polysaccharide, and oat bran. 



High-fructose corn syrup 24% 

20 Brown sugar 2 1 % 

Maltodextrin 1 2% 

Honey i\o /o 

Crisp rice 9% 

Glycerine 9% 

25 Soy polysaccharide 7% 

Oat bran 7%\ 
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C. ENSURE® HIGH PROTEIN 

Usage: ENSURE HIGH PROTEIN is a concentrated, high-protein 
liquid food designed for people who require additional calories, protein, 
vitamins, and minerals in their diets. It can be used as an oral nutritional 
5 supplement with or between meals or, in appropriate amounts, as a meal 

replacement. ENSURE HIGH PROTEIN is lactose- and gluten-free, and is 
suitable for use by people recovering from general surgery or hip fractures and 
by patients at risk for pressure ulcers. 

Patient Conditions 

10 • For patients who require additional calories, protein, vitamins, and minerals, 

such as patients recovering from general surgery or hip fractures, patients at risk 
for pressure ulcers, and patients on low-cholesterol diets 

Features- 

• Low in saturated fat 

15 • Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

• Excellent source of protein, calcium, and other essential vitamins and 
minerals 

• For low-cholesterol diets 
20 • Lactose-free, easily digested 

Ingredients: 

Vanilla Supreme: -®-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium 
and Sodium Caseinates, High-OIeic Safflower Oil, Soy Protein Isolate, Soy Oil, 
Canola Oil, Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, 
25 Magnesium Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium 

Chloride, Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc 
Sulfate, Ferrous Suffate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
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Riboflavin, Folio Acid, Sodium Motybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D3 and 
Cyanocobalarnin. 

Protein: 

5 The protein source is a blend of two high-biologic-value proteins: casein and 

soy. 

Sodium and calcium caseinates 85% 
Soy protein isolate 1 5% 

Fat: 

10 The fat source is a blend of three oils: high-oleic safflower, canola, and soy. 

High-oleic safflower oil 40% 
Canola oil 30% 
Soy oil 30% 

The level of fat in ENSURE HIGH PROTEIN meets American Heart 
1 5 Association (AHA) guidelines. The 6 grams of fat in ENSURE HIGH 

PROTEIN represent 24% of the total calories, with 2.6% of the fat being from 
saturated fatty acids and 7.9% from polyunsaturated fatty acids. These values 
are within the AHA guidelines of < 30% of total calories from fat, < 1 0% of the 
calories from saturated fatty acids, and < 1 0% of total calories from 
20 polyunsaturated fatty acids. 

Carbohydrate: 

ENSURE HIGH PROTEIN contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla supreme, chocolate 
royal, wild berry, and banana), plus VARI-FLAVORSO® Flavor Pacs in pecan, 
25 cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 

patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 60% 
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Maltodextrin 



40% 



Chocolate 



Sucrose 



70% 



Maltodextrin 



30% 



10 



15 



20 



25 
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D. ENSURE ® LIGHT 

Usage: ENSURE LIGHT is a low-fat liquid food designed for use as an 
oral nutritional supplement with or between meals. ENSURE LIGHT is 
lactose- and gluten-free, and is suitable for use in modified diets, including low- 
cholesterol diets. 

Patient Conditions: 

• For normal-weight or overweight patients who need extra nutrition in a 
supplement that contains 50% less fat and 20% fewer calories than ENSURE 

• For healthy adults who don't eat right and need extra nutrition 
Features: 

• Low in fat and saturated fat 

• Contains 3 g of total fat per serving and < 5 mg cholesterol 

• Rich, creamy taste 

• Excellent source of calcium and other essential vitamins and minerals 

• For low-cholesterol diets 

• Lactose-free, easily digested 
Ingredients: 

French Vanilla: ©-D Water, Maltodextrin (Corn), Sugar (Sucrose), Calcium 
Caseinate, High-Oleic Safflower Oil, Canola Oil, Magnesium Chloride, Sodium 
Citrate, Potassium Citrate, Potassium Phosphate Dibasic, Magnesium Phosphate 
Dibasic, Natural and Artificial Flavor, Calcium Phosphate Tribasic, Cellulose 
Gel,. Choline Chloride, Soy Lecithin, Carrageenan, Salt (Sodium Chloride), 
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Ascorbic Acid, Cellulose Gum, Ferrous Sulfate, Alpha-Tocopheryl Acetate, 
Zinc Sulfate, Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric 
Sulfate, Thiamine Chloride Hydrochloride, Vitamin A Palmitate, Pyridoxine 
Hydrochloride, Riboflavin, Chromium Chloride, Folic Acid, Sodium 
5 Molybdate, Biotin, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin 

D 3 and Cyanocobalamin. 

Protein: 

The protein source is calcium caseinate. 

Calcium caseinate 100% 

10 Fat 

The fat source is a blend of two oils: high-oleic safflower and canola. 
High-oleic safflower oil 70% 
Canola oil 30% 

The level of fat in ENSURE LIGHT meets American Heart Association 
15 (AHA) guidelines. The 3 grams of fat in ENSURE LIGHT represent 13.5% of 

the total calories, with 1 .4% of the fat being from saturated fatty acids and 2.6% 
from polyunsaturated fatty acids. These values are within the AHA guidelines 
of < 30% of total calories from fat, < 1 0% of the calories from saturated fatty 
acids, and < 1 0% of total calories from polyunsaturated fatty acids. 

20 Carbohydrate 

ENSURE LIGHT contains a combination of maltodextrin and sucrose. 
The chocolate flavor contains corn syrup as well. The mild sweetness and 
flavor variety (French vanilla, chocolate supreme, strawberry swirl), plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
25 orange, help to prevent flavor fatigue and aid in patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 51% 
Maltodextrin 49o/ 0 
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Chocolate 

Sucrose 47.0% 

Corn Syrup 26.5% 

Maltodextrin 26.5% 
5 Vitamins and Minerals 

An 8-fl-oz serving of ENSURE LIGHT provides at least 25% of the 
RDIs for 24 key vitamins and minerals. 

Caffeine 

Chocolate flavor contains 2.1 mg caffeine/8 fl oz. 

10 

E. ENSURE PLUS® 

Usage: ENSURE PLUS is a high-calorie, low-residue liquid food for 
use when extra calories and nutrients, but a normal concentration of protein, are 
needed. It is designed primarily as an oral nutritional supplement to be used 
1 5 with or between meals or, in appropriate amounts, as a meal replacement. 

ENSURE PLUS is lactose- and gluten-free. Although it is primarily an oral 
nutritional supplement, it can be fed by tube. 

Patient Conditions: 

• For patients who require extra calories and nutrients, but a normal 
20 concentration of protein, in a limited volume 

• For patients who need to gain or maintain healthy weight 
Features 

• Rich, creamy taste 

• Good source of essential vitamins and minerals 
25 Ingredients 

Vanilla: ©-D Water, Corn Syrup, Maltodextrin (Corn), Corn Oil, Sodium and 
Calcium Caseinates, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, 
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Potassium Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and 
Artificial Flavor, Sodium Citrate, Potassium Chloride, Choline Chloride, 
Ascorbic Acid, Carrageenan, Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl 
Acetate, Niacinamide, Calcium Pantothenate, Manganese Sulfate, Cupric 
5 Sulfate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrpchloride, 

Riboflavin, Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, 
Sodium Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobalamin and Vitamin D 3 . 

Protein 

10 The protein source is a blend of two high-biologic-value proteins: casein 

and soy. 

Sodium and calcium caseinates 84% 
Soy protein isolate 1 6% 

Fat 

1 5 The fat source is corn oil. 

Com oil 100% 
Carbohydrate 

ENSURE PLUS contains a combination of maltodextrin and sucrose. 
The mild sweetness and flavor variety (vanilla, chocolate, strawberry, coffee, 
20 buffer pecan, and eggnog), plus VARI-FLAVORS® Flavor Pacs in pecan, 

cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 
patient compliance. 

Vanilla, strawberry, butter pecan, and coffee flavors 



Com Syrup 39% 

25 Maltodextrin 38% 

Sucrose 23% 
Choc late and eggnog flavors 

Corn Syrup 36% 



-121- 



BNSDOCID:<WO 9S46764A1> 



WO 98/46764 



PCT/US98/07421 



Maltodextrin 34% 
Sucrose 20% 
Vitamins and Minerals 

An 8-fl-oz serving of ENSURE PLUS provides at least 15% of the RDIs 
for 25 key Vitamins and minerals. 

Caffeine 

Chocolate flavor contains 3.1 mg Caffeine/8 fl oz. Coffee flavor 
contains a trace amount of caffeine. 



1 0 F. ENSURE PLUS® HN 

Usage: ENSURE PLUS HN is a nutritionally complete high-calorie, 
high-nitrogen liquid food designed for people with higher calorie and protein 
needs or limited volume tolerance. It may be used for oral supplementation or 
for total nutritional support by tube. ENSURE PLUS HN is lactose- and gluten- 
15 free. 

Patient Conditions: 

• For patients with increased calorie and protein needs, such as following 
surgery or injury 

• For patients with limited volume tolerance and early satiety 
20 Features 

• For supplemental or total nutrition 

• For oral or tube feeding 

• 1.5CaVmL 

• High nitrogen 
25 • Calorically dense 
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Ingredients 

Vanilla: ©-D Water, Maltodextrin (Corn), Sodium and Calcium Caseinates, 
Corn Oil, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, Potassium 
Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and Artificial 
Flavor, Sodium Citrate, Choline Chloride, Ascorbic Acid, Taurine, L-Carnitine, 
Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Carrageenan, Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, 
Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, 
Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, Sodium 
Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobalamin and Vitamin D 3 . 



G. ENSURE® POWDER 

Usage: ENSURE POWDER (reconstituted with water) is a low-residue 
1 5 liquid food designed primarily as an oral nutritional supplement to be used with 

or between meals. ENSURE POWDER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions: 

• For patients on modified diets 

20 • For elderly patients at nutrition risk 

• For patients recovering from illness/surgery 

• For patients who need a low-residue diet 
Features 

• Convenient, easy to mix 
25 • Low in saturated fat 

• Contains 9 g of total fat and < 5 mg of cholesterol per serving 

• High in vitamins and minerals 

• For low-cholesterol diets 
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• Lactose-free, easily digested 

Ingredients: ©-D Corn Syrup, Maitodextrin (Corn), Sugar (Sucrose), Corn Oil, 
Sodium and Calcium Caseinates, Soy Protein Isolate, Artificial Flavor, 
Potassium Citrate, Magnesium Chloride, Sodium Citrate, Calcium Phosphate 
5 Tribasic, Potassium Chloride, Soy Lecithin, Ascorbic Acid, Choline Chloride, 

Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Thiamine Chloride Hydrochloride, 
Cupric Sulfate, Pyridoxine Hydrochloride, Riboflavin, Vitamin A Palmitate, 
Folic Acid, Biotin, Sodium Molybdate, Chromium Chloride, Potassium Iodide, 
10 Sodium Selenate, Phylloquinone, Vitamin D 3 and Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins: casein 
and soy. 

Sodium and calcium caseinates 84% 
1 5 Soy protein isolate 1 6% 

Fat 

The fat source is com oil. 

Com oil 100% 
Carbohydrate 

20 ENSURE POWDER contains a combination of com syrup, 

maitodextrin, and sucrose. The mild sweetness of ENSURE POWDER, plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
orange, helps to prevent flavor fatigue and aid in patient compliance. 



Vanilla 

25 Com Syrup 35<>/ 0 

Maitodextrin 3 5% 

Sucrose 30% 
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H. ENSURE® PUDDING 

Usage: ENSURE PUDDING is a nutrient-dense supplement providing 
balanced nutrition in a nonliquid form to be used with or between meals. It is 
appropriate for consistency-modified diets (e.g., soft, pureed, or full liquid) or 
5 for people with swallowing impairments. ENSURE PUDDING is gluten-free. 

Patient Conditions: 

• For patients on consistency-modified diets (e.g., soft, pureed, or full liquid) 

• For patients with swallowing impairments 
Features 

10 • Rich and creamy, good taste 

• Good source of essential vitamins and minerals Convenient-needs no 
refrigeration 

• Gluten-free 

Nutrient Profile per 5 oz: Calories 250, Protein 10.9%, Total Fat 34.9%, 
1 5 Carbohydrate 54.2% 

Ingredients: 

Vanilla: ©-D Nonfat Milk, Water, Sugar (Sucrose), Partially Hydrogenated 
Soybean Oil, Modified Food Starch, Magnesium Sulfate. Sodium Stearoyl 
Lactylate, Sodium Phosphate Dibasic, Artificial Flavor, Ascorbic Acid, Zinc 

20 Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Choline Chloride, 

Niacinamide, Manganese Sulfate, Calcium Pantothenate, FD&C Yellow #5, 
Potassium Citrate, Cupric Sulfate, Vitamin A Palmitate, Thiamine Chloride 
Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, FD&C Yellow #6, Folic 
Acid, Biotin, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

25 Protein 

The protein source is nonfat milk. 

Nonfat milk 1 00% 
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Fat 

The fat source is hydrogenated soybean oil. 
Hydrogenated soybean oil 1 00% 

Carbohydrate 

ENSURE PUDDING contains a combination of sucrose and modified 
food starch. The mild sweetness and flavor variety (vanilla, chocolate, 
butterscotch, and tapioca) help prevent flavor fatigue. The product contains 9.2 
grams of lactose per serving. 

Vanilla and other nonchocolate flavors 



10 Sucrose 56©^ 

Lactose 27% 

Modified food starch 1 7% 
Chocolate 

Sucrose 58% 

1 5 Lactose 26% 

Modified food starch 16% 



I. ENSURE® WITH FIBER 

Usage: ENSURE WITH FIBER is a fiber-containing, nutritionally 
complete liquid food designed for people who can benefit from increased 
dietary fiber and nutrients. ENSURE WITH FIBER is suitable for people who 
do not require a low-residue diet. It can be fed orally or by tube, and can be 
used as a nutritional supplement to a regular diet or, in appropriate amounts, as 
a meal replacement. ENSURE WITH FIBER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions 

• For patients who can benefit from increased dietary fiber and nutrients 
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Features 

• New advanced formula-low in saturated fat, higher in vitamins and minerals 

• Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

5 • Good source of fiber 

• Excellent source of essential vitamins and minerals 

• For low-cholesterol diets 

• Lactose- and gluten-free 
Ingredients 

10 Vanilla: ©-D Water, Maltodextrin (Corn), Sugar (Sucrose), Sodium and 

Calcium Caseinates, Oat Fiber, High-Oleic Safflower Oil, Canola Oil, Soy 
Protein Isolate, Corn Oil, Soy Fiber, Calcium Phosphate Tribasic, Magnesium 
Chloride, Potassium Citrate, Cellulose Gel, Soy Lecithin, Potassium Phosphate 
Dibasic, Sodium Citrate, Natural and Artificial Flavors, Choline Chloride, 

15 Magnesium Phosphate, Ascorbic Acid, Cellulose Gum, Potassium Chloride, 

Carrageenan, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Zinc Sulfate, 
Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric Sulfate, 
Vitamin A Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine 
Hydrochloride, Riboflavin, Folic Acid, Chromium Chloride, Biotin, Sodium 

20 Molybdate, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D 3 and 

Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins- casein 
and soy. 



25 Sodium and calcium caseinates 80°/c 

Soy protein isolate 20% 



r 0 



f 0 
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Fat 

The fat source is a blend of three oils: high-oleic safflower, canola, and 



corn. 

High-oleic safflower oil 40% 

5 Canola oil 40% 

Corn oil 20% 



The level of fat in ENSURE WITH FIBER meets American Heart 
Association (AHA) guidelines. The 6 grams of fat in ENSURE WITH FIBER 
represent 22% of the total calories, with 2.01 % of the fat being from saturated 
fatty acids and 6.7% from polyunsaturated fatty acids. These values are within 
the AHA guidelines of < 30% of total calories from fat, < 1 0% of the calories 
from saturated fatty acids, and < 1 0% of total calories from polyunsaturated 
fatty acids. 

Carbohydrate 

ENSURE WITH FIBER contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla, chocolate, and butter 
pecan), plus VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, 
lemon, and orange, help to prevent flavor fatigue and aid in patient compliance. 
Vanilla and other nonchocolate flavors 



20 Maltodextrin 66<>/ 0 

Sucrose 25% 

Oat Fiber 7% 

Soy Fiber 2% 
Chocolate 

25 Maltodextrin 55% 

Sucrose 26% 

Oat Fiber 7% 
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Soy Fiber 2% 

Fiber 

The fiber blend used in ENSURE WITH FIBER consists of oat fiber and 
soy polysaccharide. This blend results in approximately 4 grams of total dietary 
5 fiber per 8-fl-oz can. The ratio of insoluble to soluble fiber is 95 :5. 

The various nutritional supplements described above and known to 
others of skill in the art can be substituted and/or supplemented with the PUFAs 
of this invention. 

J- Oxepa™ Nutritional Product 

1 0 Oxepa is low-carbohydrate, calorically dense enteral nutritional product 

designed for the dietary management of patients with or at risk for ARDS. It 
has a unique combination of ingredients, including a patented oil blend 
containing eicosapentaenoic acid (EPA from fish oil), y-Hnolenic acid (GLA 
from borage oil), and elevated antioxidant levels. 

1 5 Caloric Distribution: 

• Caloric density is high at 1.5 Cal/mL (355 Cal/8 fl oz), to minimize the 
volume required to meet energy needs. 



The distribution of Calories in Oxepa is shown in Table 7. 



Table 7. Caloric Distribution of Oxepa 




per 8 fl oz. 


per liter 


%ofCal 


Calories 


355 


1,500 




Fat(g) 


22.2 


93.7 


55.2 


Carbohydrate (g) 


25 


105.5 


28.1 


Protein (g) 


14.8 


62.5 


16.7 


Water (g) 


186 


785 





20 Fat: 

• Oxepa contains 22.2 g of fat per 8-fl oz serving (93 .7 g/L). 

• The fat source is a oil blend of 3 1 .8% canola oil, 25% medium-chain 
triglycerides (MCTs), 20% borage oil, 20% fish oil, and 3.2 % soy lecithin. The 
typical fatty acid profile of Oxepa is shown in Table 8. 
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• Oxepa provides a balanced amount of polyunsaturated, monounsaturated, 
and saturated fatty acids, as shown in Table 10. 

• Medium-chain trigylcerides (MCTs) ~ 25% of the fat blend - aid gastric 
emptying because they are absorbed by the intestinal tract without 

5 emulsification by bile acids. 

The various fatty acid components of Oxepa™ nutritional product can 



10 



be substituted and/or supplemented with the PUFAs of this invention. 


Table 8. Typical Fatty Acid Profile 




ft J *T* ^ t If™* 

% Total Fatty 
Acids 


g/8 fl oz* 


g/L* 


Caproic (6:0) 


0.2 


0.04 


0.18 


Caprylic (8:0) 


14.69 


3.1 


13.07 


Capric(10:0) 


11.06 


2.33 


9.87 


Palmitic (16:0) 


5.59 


1.18 


4.98 


Palmitoleic(16:ln-7) 


1.82 


0.38 


1.62 


Stearic (18:0) 


1.84 


0.39 


1.64 


Oleic(18:ln-9) 


24.44 


5.16 


21.75 


Linoleic (18:2n-6) 


16.28 


3.44 


14.49 


cc-Linolenic (18:3n-3) 


3.47 


0.73 


3.09 


Y-Linolenic (18:3n-6) 


4.82 


1.02 


4.29 


Eicosapentaenoic (20:5n- 
3) 


5.11 


1.08 


4.55 


n-3-Docosapentaenoic 
(22:5n-3) 


0.55 


0.12 


0.49 


Docosahexaenoic (22:6n- 
3) 


2.27 


0.48 


2.02 


Others 


7.55 


1.52 


6.72 


fatty acids equal approximately 95% of total 


fat. 




Table 9. Fat Profile of Oxepa. 


Vo ot total calories trom fat 


55.2 


Polyunsaturated fatty acids 


31.44 g/L 


Monounsaturated fatty acids 


25.53 g/L 


Saturated fatty acids 


32.38 g/L 


n-6 to n-3 ratio 


1.75:1 


Cholesterol 


9.49 mg/8 fl oz 
40.1 mg/L 
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Carbohydrate: 

• The carbohydrate content is 25.0 g per 8-fl-oz serving (105.5 g/L). 

© The carbohydrate sources are 45% maltodextrin (a complex carbohydrate) 
and 55% sucrose (a simple sugar), both of which are readily digested and 
5 absorbed. 

• The high-fat and low-carbohydrate content of Oxepa is designed to 
minimize carbon dioxide (CO2) production. High CO2 levels can complicate 
weaning in ventilator-dependent patients. The low level of carbohydrate also 
may be useful for those patients who have developed stress-induced 

1 0 hyperglycemia. 

• Oxepa is lactose-free. 

Dietary carbohydrate, the amino acids from protein, and the glycerol 
moiety of fats can be converted to glucose within the body. Throughout this 
process, the carbohydrate requirements of glucose-dependent tissues (such as 

1 5 the central nervous system and red blood cells) are met. However, a diet free of 

carbohydrates can lead to ketosis, excessive catabolism of tissue protein, and 
loss of fluid and electrolytes. These effects can be prevented by daily ingestion 
of 50 to 100 g of digestible carbohydrate, if caloric intake is adequate. The 
carbohydrate level in Oxepa is also sufficient to minimize gluconeogenesis, if 

20 energy needs are being met. 

Protein: 

• Oxepa contains 14.8 g of protein per 8-fl-oz serving (62.5 g/L). 

• The total calorie/nitrogen ratio (150:1) meets the need of stressed patients. 

• Oxepa provides enough protein to promote anabolism and the maintenance 
25 of lean body mass without precipitating respiratory problems. High protein 

intakes are a concern in patients with respiratory insufficiency. Although 
protein has little effect on C0 2 production, a high protein diet will increase 
ventilatory drive. 
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• The protein sources of Oxepa are 86.8% sodium caseinate and 13.2% 
calcium caseinate. 

• As demonstrated in Table 1 1, the amino acid profile of the protein system in 
Oxepa meets or surpasses the standard for high quality protein set by 

5 theNational Academy of Sciences. 

• Oxepa is gluten-free. 

All publications and patent applications mentioned in this specification 
are indicative of the level of skill of those skilled in the art to which this 
0 invention pertains. All publications and patent applications are herein 

incorporated by reference to the same extent as if each individual publication or 
patent application was specifically and individually indicated to be incorporated 
by reference. 

The invention now being fully described, it will be apparent to one of 
5 ordinary skill in the art that many changes and modifications can be made 

thereto without departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: KNUTZON, DEBORAH 
MURKER JI, PRADIP 
HUANG, YUNG-SHENG 
THURMOND/ JENNIFER 
CHAUDHARY, SUNITA 
LEONARD, AMANDA 

15 <ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR SYNTHESIS 

OF LONG CHAIN POLY -UNSATURATED FATTY ACIDS IN PLANTS 

(iii) NUMBER OF SEQUENCES: 52 

20 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LIMBACH & LIMBACH L.L.P. 

(B) STREET: 2001 FERRY BUILDING 

(C) CITY: SAN FRANCISCO 

(D) STATE: CA 

25 (E) COUNTRY: USA 

(F) ZIP: 94111 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

30 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS-DOS 

(D) SOFTWARE r Microsoft Word 

(vi) CURRENT APPLICATION DATA: 
35 (A) APPLICATION NUMBER: 

<B) FILING DATE: 
( C ) CLASS I FI CATION : 

(vii) PRIOR APPLICATION DATA: 
40 (A) APPLICATION NUMBER: US 08/834,033 

(B) FILING DATE: ll-APR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/833,610 
45 (B) FILING DATE: ll-APR-1997 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: MICHAEL R. WARD 

(B) REGISTRATION NUMBER: 38,351 

50 (C> REFERENCE/DOCKET NUMBER: CGAB-320 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 433-4150 

(B) TELEFAX: (415) 433-8716 
55 (C) TELEX: N/A 

(2) INFORMATION FOR SEQ ID NO:l: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1617 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 





CGACACTCCT 


TCCTTCTTCT 


CACCCGTCCT 


AGTCCCCTTC 


AACCCCCCTC 


TTTGACAAAG 


60 


1 < 

1 J 


ACAACAAACC 


ATGGCTGCTG 


CTCCCAGTGT 


GAGGACGTTT 


ACTCGGGCCG 


AGGTTTTGAA 


120 




TGCCGAGGCT 


CTGAATGAGG 


GCAAGAAGGA 


TGCCGAGGCA 


CCCTTCTTGA 


TGATCATCGA 


180 




CAACAAGGTG 


TACGATGTCC 


GCGAGTTCGT 


CCCTGATCAT 


CCCGGTGGAA 


GTGTGATTCT 


240 


20 


CACGCACGTT 


GGCAAGGACG 


GCACTGACGT 


CTTTGACACT 


TTTCACCCCG 


AGGCTGCTTG 


300 




GGAGACTCTT 


GCCAACTTTT 


ACGTTGGTGA 


TATTGACGAG 


AGCGACCGCG 


ATATCAAGAA 


360 


25 


TGATGACTTT 


GCGGCCGAGG 


TCCGCAAGCT 


GCGTACCTTG 


TTCCAGTCTC 


TTGGTTACTA 


420 




CGATTCTTCC 


AAGGCATACT 


ACGCCTTCAA 


GGTCTCGTTC 


AACCTCTGCA 


TCTGGGGTTT 


480 




GTCGACGGTC 


ATTGTGGCCA 


AGTGGGGCCA 


GACCTCGACC CTCGCCAACG 


TGCTCTCGGC 


540 


30 


TGCGCTTTTG 


GGTCTGTTCT 


GGCAGCAGTG 


CGGATGGTTG 


GCTCACGACT 


TTTTGCATCA 


600 




CCAGGTCTTC 


CAGGACCGTT 


TCTGGGGTGA 


TCTTTTCGGC 


GCCTTCTTGG 


GAGGTGTCTG 


660 


35 


CCAGGGCTTC 


TCGTCCTCGT 


GGTGGAAGGA 


CAAGCACAAC 


ACTCACCACG 


CCGCCCCCAA 


720 




CGTCCACGGC 


GAGGATCCCG 


AC AT TG AC AC 


CCACCCTCTG 


TTGACCTGGA 


GTGAGCATGC 


780 




GTTGGAGATG 


TTCTCGGATG 


TCCCAGATGA 


GGAGCTGACC 


CGCATGTGGT 


CGCGTTTCAT 


840 


40 


GGTCCTGAAC 


CAGACCTGGT 


TTTACTTCCC 


CATTCTCTCG 


TTTGCCCGTC 


TCTCCTGGTG 


900 




CCTCCAGTCC 


ATTCTCTTTG 


TGCTGCCTAA 


CGGTCAGGCC 


CACAAGCCCT 


CGGGCGCGCG 


960 


45 


TGTGCCCATC 


TCGTTGGTCG 


AGCAGCTGTC 


GCTTGCGATG 


CACTGGACCT 


GGTACCTCGC 


1020 




CACCATGTTC 


CTGTTCATCA 


AGGATCCCGT 


CAACATGCTG 


GTGTACTTTT 


TGGTGTCGCA 


1080 




GGCGGTGTGC 


GGAAACTTGT 


TGGCGATCGT 


GTTCTCGCTC 


AACCACAACG 


GTATGCCTGT 


1140 


50 


GATCTCGAAG 


GAGGAGGCGG 


TCGATATGGA 


TTTCTTCACG 


AAGCAGATCA 


TCACGGGTCG 


1200 




TGATGTCCAC 


CCGGGTCTAT 


TTGCCAACTG 


GTTCACGGGT 


GGATTGAACT 


ATCAGATCGA 


1260 


55 


GCACCACTTG 


TTCCCTTCGA 


TGCCTCGCCA 


CAACTTTTCA 


AAGATCCAGC 


CTGCTGTCGA 


1320 




GACCCTGTGC 


AAAAAGTACA 


ATGTCCGATA 


CCACACCACC 


GGTATGATCG 


AGGGAACTGC 


1380 




AGAGGTCTTT 


AGCCGTCTGA 


ACGAGGTCTC 


CAAGGCTGCC 


TCCAAGATGG 


GTAAGGCGCA 


1440 


60 


GTAAAAAAAA 


AAACAAGGAC 


GTTTTTTTTC 


GCCAGTGCCT 


GTGCCTGTGC 


CTGCTTCCCT 


1500 




TGTCAAGTCG 


AGCGTTTCTG 


GAAAGGATCG 


TTCAGTGCAG 


TATCATCATT 


CTCCTTTTAC 


1560 
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15 



20 



30 



35 



40 



45 



55 



60 



CCCCCGCTCA TATCTCATTC ATTTCTCTTA TTAAACAACT TGTTCCCCCC TTCACCG 1617 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
1 5 10 15 

~ c Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 

25 20 25 30 

Leu Met He He Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 7 0 75 80 

Ala Asn Phe Tyr Val Gly Asp He Asp Glu Ser Asp Arg Asp He Lys 
85 90 95 

Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 110 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys He Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
50 145 150 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 



His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
180 185 190 

Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 
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10 



15 



25 



30 



35 



40 



45 



55 



60 



He Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
2 <5 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 



„ Leu phe Ile L ys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 

20 325 330 335 



Gin Ala Val Cys Gly Asn Leu Leu Ala Ile Val Phe Ser Leu Asn His 
340 345 350 

Asn Gly Met Pro Val He Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin He Ile Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin Ile Glu His His Leu 
385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys Ile Gin Pro Ala Val 
405 410 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
420 425 430 

Ile Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1488 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
GTCCCCTGTC GCTGTCGGCA CACCCCATCC TCCCTCGCTC CCTCTGCGTT TGTCCTTGGC 60 
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CCACCGTCTC 


TCCTCCACCC 


TCCGAGACGA 


CTGCAACTGT 


AATCAGGAAC 


CGACAAATAC 


120 


ACGATTTCTT 


TTTACTCAGC 


ACCAACTCAA 


AATCCTCAAC 


CGCAACCCTT 


TTTCAGGATG 


180 


GCACCTCCCA 


ACACTATCGA 


TGCCGGTTTG 


ACCCAGCGTC 


ATATCAGCAC 


CTCGGCCCCA 


240 


AACTCGGCCA 


AGCCTGCCTT 


CGAGCGCAAC 


TACCAGCTCC 


CCGAGTTCAC 


CATCAAGGAG 


300 


ATCCGAGAGT 


GCATCCCTGC 


CCACTGCTTT 


GAGCGCTCCG 


GTCTCCGTGG 


TCTCTGCCAC 


360 


GTTGCCATCG 


ATCTGACTTG 


GGCGTCGCTC 


TTGTTCCTGG 


CTGCGACCCA 


GATCGACAAG 


420 


TTTGAGAATC 


CCTTGATCCG 


CTATTTGGCC 


TGGCCTGTTT 


ACTGGATCAT 


GCAGGGTATT 


480 


GTCTGCACCG 


GTGTCTGGGT 


GCTGGCTCAC 


GAGTGTGGTC 


ATCAGTCCTT 


CTCGACCTCC 


540 


AAGACCCTCA 


ACAACACAGT 


TGGTTGGATC 


TTGCACTCGA 


TGCTCTTGGT 


CCCCTACCAC 


600 


TCCTGGAGAA 


TCTCGCACTC 


GAAGCACCAC 


AAGGCCACTG 


GCCATATGAC 


CAAGGACCAG 


660 


GTCTTTGTGC 


CCAAGACCCG 


CTCCCAGGTT 


GGCTTGCCTC 


CCAAGGAGAA 


CGCTGCTGCT 


720 


GCCGTTCAGG 


AGGAGGACAT 


GTCCGTGCAC 


CTGGATGAGG 


AGGCTCCCAT 


TGTGACTTTG 


780 


TTCTGGATGG 


TGATCCAGTT 


CTTGTTCGGA 


TGGCCCGCGT 


ACCTGATTAT 


GAACGCCTCT 


840 


GGCCAAGACT 


ACGGCCGCTG 


GACCTCGCAC 


TTCCACACGT 


ACTCGCCCAT 


CTTTGAGCCC 


900 


CGCAACTTTT 


TCGACATTAT 


TATCTCGGAC 


CTCGGTGTGT 


TGGCTGCCCT 


CGGTGCCCTG 


960 


ATCTATGCCT 


CCATGCAGTT 


GTCGCTCTTG 


ACCGTCACCA 


AGTACTATAT 


TGTCCCCTAC 


1020 


CTCTTTGTCA 


ACTTTTGGTT 


GGTCCTGATC 


ACCTTCTTGC 


AGCACACCGA 


TCCCAAGCTG 


1080 


CCCCATTACC 


GCGAGGGTGC 


CTGGAATTTC 


CAGCGTGGAG 


CTCTTTGCAC 


CGTTGACCGC 


1140 


TCGTTTGGCA 


AGTTCTTGGA 


CCATATGTTC 


CACGGCATTG 


TCCACACCCA 


TGTGGCCCAT 


1200 


CACTTGTTCT 


CGCAAATGCC 


GTTCTACCAT 


GCTGAGGAAG 


CTACCTATCA 


TCTCAAGAAA 


1260 


CTGCTGGGAG 


AGTACTATGT 


GTACGACCCA 


TCCCCGATCG 


TCGTTGCGGT 


CTGGAGGTCG 


1320 


TTCCGTGAGT 


GCCGATTCGT 


GGAGGATCAG 


GGAGACGTGG 


TCTTTTTCAA 


GAAGTAAAAA 


1380 


AAAAGACAAT 


GGACCACACA 


CAACCTTGTC 


TCTACAGACC 


TACGTATCAT 


GTAGCCATAC 


1440 


CACTTCATAA 


AAGAACATGA 


GCTCTAGAGG 


CGTGTCATTC 


GCGCCTCC 




1488 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 ; 
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Met Ala 
1 

Ser Thr 



Gin Leu 



His Cys 
50 



Pro Pro Asn Thr He Asp Ala Gly Leu Thr Gin Arg His He 
5 10 15 

Ser Ala Pro Asn Ser Ala Lys Pro Ala Phe Glu Arg Asn Tyr 
20 25 30 

Pro Glu Phe Thr He Lys Glu He Arg Glu Cys He Pro Ala 
35 40 45 

Phe Glu Arg Ser Gly Leu Arg Gly Leu Cys His Val Ala He 
55 60 



Asp Leu Thr Trp Ala Ser Leu Leu Phe Leu Ala Ala Thr Gin He Asp 
65 7 0 75 80 

Lys Phe Glu Asn Pro Leu He Arg Tyr Leu Ala Trp Pro Val Tyr Trp 
85 90 95 

He Met Gin Gly He Val Cys Thr Gly Val Trp Val Leu Ala His Glu 
100 X05 HO 

Cys Gly His Gin Ser Phe Ser Thr Ser Lys Thr Leu Asn Asn Thr Val 
115 120 125 



Gly Trp 
130 

He Ser 
145 

Gin Val 

Glu Asn 

Asp Glu 

Leu Phe 
210 

Tyr Gly 
225 

Pro Arg 
Ala Leu 
Val Thr 



Val Leu 
290 

Arg Glu 
305 



He Leu His Ser Met Leu Leu Val Pro Tyr His Ser Trp Arg 
135 140 

His Ser. Lys His His Lys Ala Thr Gly His Met Thr Lys Asp 
150 155 i 60 

Phe Val Pro Lys Thr Arg Ser Gin Val Gly Leu Pro Pro Lys 
165 170 175 

Ala Ala Ala Ala Val Gin Glu Glu Asp Met Ser Val His Leu 
180 185 190 

Glu Ala Pro He Val Thr Leu Phe Trp Met Val He Gin Phe 
195 200 205 

Gly Trp Pro Ala Tyr Leu He Met Asn Ala Ser Gly Gin Asp 
215 220 

Arg Trp Thr Ser His Phe His Thr Tyr Ser Pro He Phe Glu 
230 235 240 

Asn Phe Phe Asp He He He Ser Asp Leu Gly Val Leu Ala 
245 250 255 

Gly Ala Leu He Tyr Ala Ser Met Gin Leu Ser Leu Leu Thr 
260 265 270 

Lys Tyr Tyr He Val Pro Tyr Leu Phe Val Asn Phe Trp Leu 
27 5 280 285 

He Thr Phe Leu Gin His Thr Asp Pro Lys Leu Pro His Tyr 
295 300 

Gly Ala Trp Asn Phe Gin Arg Gly Ala Leu Cys Thr Val Asp 
310 315 320 
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Arg Ser Phe Gly Lys Phe Leu Asp His Met Phe His Gly He Val His 
325 330 335 

Thr His Val Ala His His Leu Phe Ser Gin Met Pro Phe Tyr His Ala 
340 345 350 

Glu Glu Ala Thr Tyr His Leu Lys Lys Leu Leu Gly Glu Tyr Tyr Val 
355 360 365 

Tyr Asp Pro Ser Pro He Val Val Ala Val Trp Arg Ser Phe Arg Glu 
3*70 375 380 

Cys Arg Phe Val Glu Asp Gin Gly Asp Val Val Phe Phe Lys Lys 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1483 base pairs 

20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



30 



60 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCTTCCTCCA GTTCATCCTC CATTTCGCCA CCTGCATTCT TTACGACCGT TAAGCAAGAT 60 

GGGAACGGAC CAAGGAAAAA CCTTCACCTG GGAAGAGCTG GCGGCCCATA ACACCAAGGA 120 

35 CGACCTACTC TTGGCCATCC GCGGCAGGGT GTACGATGTC ACAAAGTTCT TGAGCCGCCA 180 

TCCTGGTGGA GTGGACACTC TCCTGCTCGG AGCTGGCCGA GATGTTACTC CGGTCTTTGA 240 

^ GATGTATCAC GCGTTTGGGG CTGCAGATGC CATTATGAAG AAGTACTATG TCGGTACACT 300 

GGTCTCGAAT GAGCTGCCCA TCTTCCCGGA GCCAACGGTG TTCCACAAAA CCATCAAGAC 3 60 

GAGAGTCGAG GGCTACTTTA CGGATCGGAA CATTGATCCC AAGAATAGAC CAGAGATCTG 420 

45 GGGACGATAC GCTCTTATCT TTGGATCCTT GATCGCTTCC TACTACGCGC AGCTCTTTGT 4 80 

GCCTTTCGTT GTCGAACGCA CATGGCTTCA GGTGGTGTTT GCAATCATCA TGGGATTTGC 54 0 

5Q GTGCGCACAA GTCGGACTCA ACCCTCTTCA TGATGCGTCT CACTTTTCAG TGACCCACAA 600 

CCCCACTGTC TGGAAGATTC TGGGAGCCAC GCACGACTTT TTCAACGGAG CATCGTACCT 660 

GGTGTGGATG TACCAACATA TGCTCGGCCA TCACCCCTAC ACCAACATTG CTGGAGCAGA 72 0 

55 TCCCGACGTG TCGACGTCTG AGCCCGATGT TCGTCGTATC AAGCCCAACC AAAAGTGGTT 780 

TGTCAACCAC ATCAACCAGC ACATGTTTGT TCCTTTCCTG TACGGACTGC TGGCGTTCAA 840 

GGTGCGCATT CAGGACATCA ACATTTTGTA CTTTGTCAAG ACCAATGACG CTATTCGTGT 900 

CAATCCCATC TCGACATGGC ACACTGTGAT GTTCTGGGGC GGCAAGGCTT TCTTTGTCTG 960 
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GTATCGCCTG ATTGTTCCCC TGCAGTATCT GCCCCTGGGC AAGGTGCTGC TCTTGTTCAC 1020 

GGTCGCGGAC ATGGTGTCGT CTTACTGGCT GGCGCTGACC TTCCAGGCGA ACCACGTTGT 1080 

TGAGGAAGTT CAGTGGCCGT TGCCTGACGA GAACGGGATC ATCCAAAAGG ACTGGGCAGC 1140 

TATGCAGGTC GAGACTACGC AGGATTACGC ACACGATTCG CACCTCTGGA CCAGCATCAC 12 00 

TGGCAGCTTG AACTACCAGG CTGTGCACCA TCTGTTCCCC AACGTGTCGC AGCACCATTA 12 60 

TCCCGATATT CTGGCCATCA TCAAGAACAC CTGCAGCGAG TACAAGGTTC CATACCTTGT 1320 

CAAGGATACG TTTTGGCAAG CATTTGCTTC ACATTTGGAG CACTTGCGTG TTCTTGGACT 1380 

15 CCGTCCCAAG GAAGAGTAGA AGAAAAAAAG CGCCGAATGA AGTATTGCCC CCTTTTTCTC 14 40 

CAAGAATGGC AAAAGGAGAT CAAGTGGACA TTCTCTATGA AGA 14 83 
< 2 > INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
25 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gly Thr Asp Gin Gly Lys Thr Phe Thr Trp Glu Glu Leu Ala Ala 
1 5 10 15 

His Asn Thr Lys Asp Asp Leu Leu Leu Ala lie Arg Gly Arg Val Tyr 
20 25 30 

Asp Val Thr Lys Phe Leu Ser Arg His Pro Gly Gly Val Asp Thr Leu 
35 40 45 

Leu Leu Gly Ala Gly Arg Asp Val Thr Pro Val Phe Glu Met Tyr His 
50 55 60 

Ala Phe Gly Ala Ala Asp Ala lie Met Lys Lys Tyr Tyr Val Glv Thr 
65 70 75 80 

Leu Val Ser Asn Glu Leu Pro lie Phe Pro Glu Pro Thr Val Phe His 
85 go 95 

Lys Thr He Lys Thr Arg Val Glu Gly Tyr Phe Thr Asp Arg Asn He 
1Q 0 105 no 

Asp Pro Lys Asn Arg Pro Glu He Trp Gly Arg Tyr Ala Leu lie Phe 
115 120 125 

Gly Ser Leu lie Ala Ser Tyr Tyr Ala Gin Leu Phe Val Pro Phe Val 
130 135 140 

Val Glu Arg Thr Trp Leu Gin Val Val Phe Ala He He Met Gly Phe 
145 150 155 Y 16Q 
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Ala Cys Ala Gin Val Gly Leu Asn Pro Leu His Asp Ala Ser His Phe 
165 170 175 

Ser Val Thr His Asn Pro Thr Val Trp Lys lie Leu Gly Ala Thr His 
180 185 190 

Asp Phe Phe Asn Gly Ala Ser Tyr Leu Val Trp Met Tyr Gin His Met 
195 200 205 

Leu Gly His His Pro Tyr Thr Asn lie Ala Gly Ala Asp Pro Asp Val 
210 215 220 

Ser Thr Ser Glu Pro Asp Val Arg Arg lie Lys Pro Asn Gin Lys Trp 
225 230 235 240 

Phe Val Asn His lie Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly 
245 250 255 

Leu Leu Ala Phe Lys Val Arg lie Gin Asp lie Asn lie Leu Tyr Phe 
260 265 270 

Val Lys Thr Asn Asp Ala lie Arg Val Asn Pro lie Ser Thr Trp His 
275 280 285 

Thr Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu 
290 295 300 

He Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe 
305 310 315 320 

Thr Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin 
325 330 335 

Ala Asn His Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn 
340 345 350 

Gly He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin 
355 360 365 

Asp Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu 
370 375 380 

Asn Tyr Gin Ala Val His His Leu Phe Pro Asn Val Ser Gin His His 
385 390 395 400 

Tyr Pro Asp He Leu Ala He He Lys Asn Thr Cys Ser Glu Tyr Lys 
405 410 415 

Val Pro Tyr Leu Val Lys Asp Thr Phe Trp Gin Ala Phe Ala Ser His 
420 425 430 

Leu Glu His Leu Arg Val Leu Gly Leu Arg Pro Lys Glu Glu 
435 440 445 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 



Glu Val Arg Lys Leu Arg Thr Leu Phe Gin Ser Leu Gly Tyr Tyr Asp 
10 1 5 10 15 



Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val Ser Phe Asn Leu Cys He 
20 25 30 

Trp Gly Leu Ser Thr Val He Val Ala Lys Trp Gly Gin Thr Ser Thr 
35 40 45 

Leu Ala Asn Val Leu Ser Ala Ala Leu Leu Gly Leu Phe Trp Gin Gin 
50 55 60 

Cys Gly Trp Leu Ala His Asp Phe Leu His His Gin Val Phe Gin Asp 
65 70 75 80 

Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe Leu Gly Gly Val Cys Gin 
85 90 95 

Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys His Asn Thr His His Ala 
100 105 HO 

Ala Pro Asn Val His Gly Glu Asp Pro Asp He Asp Thr His Pro Leu 
115 120 125 

Leu Thr Trp Ser Glu His Ala Leu Glu Met Phe Ser Asp Val Pro Asp 
130 135 140 

Glu Glu Leu Thr Arg Met Trp Ser Arg Phe Met Val Leu Asn Gin Thr 
i45 150 155 160 

Trp Phe Tyr Phe Pro He Leu Ser Phe Ala Arg Leu Ser Trp Cys Leu 
165 170 175 

Gin Ser He Leu Phe Val Leu Pro Asn Gly Gin Ala His Lys Pro Ser 
180 185 190 

Gly Ala Arg Val Pro He Ser Leu Val Glu Gin Leu Ser Leu Ala Met 
195 200 205 

His Trp Thr Trp Tyr Leu Ala Thr Met Phe Leu Phe He Lys Asp Pro 
210 215 220 

Val Asn Met Leu Val Tyr Phe Leu Val Ser Gin Ala Val Cys Gly Asn 
225 230 235 240 

Leu Leu Ala He Val Phe Ser Leu Asn His Asn Gly Met Pro Val He 
245 250 255 

Ser Lys Glu Glu Ala Val Asp Met Asp Phe Phe Thr Lys Gin He He 
260 265 270 

Thr Gly Arg Asp Val His Pro Gly Leu Phe Ala Asn Trp Phe Thr Gly 
275 280 285 
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Gly Leu Asn Tyr Gin He Glu His His Leu Phe Pro Ser Met Pro Arg 
290 295 300 

His Asn Phe Ser Lys He Gin Pro Ala Val Glu Thr Leu Cys Lys Lys 
305 310 315 320 

Tyr Asn Val Arg Tyr His Thr Thr Gly Met He Glu Gly Thr Ala Glu 
325 330 335 

Val Phe Ser Arg Leu Asn Glu Val Ser Lys Ala Ala Ser Lys Met Gly 
340 345 350 

Lys Ala Gin 
355 

(2) INFORMATION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Val Thr Leu Tyr 

1 

Leu Tyr Gly Val 
20 

Ala Gly Leu Leu 
35 

Asp Ser Gly His 
50 

Ala Gin Leu Leu 
65 

Lys Trp Thr His 



Gly Pro Asn Leu 
100 



Thr Leu Ala Phe 
5 

Leu Ala Cys Pro 



Gly Leu Leu Trp 
40 

Tyr Val He Met 
55 

Ser Gly Asn Cys 
70 

Asn Ala His His 
85 

Gin His He Pro 



Val Ala Ala Asn 
10 



Ser Val Xaa Pro 
25 



He Gin Ser Ala 



Ser Asn Lys Ser 
60 

Leu Thr Gly He 
75 

Leu Ala Cys Asn 
90 



Ser Leu Gly Val 
15 

His Gin He Ala 
30 

Tyr He Gly Xaa 
45 

Asn Asn Xaa Phe 



He Ala Trp Trp 
80 

Ser Leu Asp Tyr 
95 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Gly Val Leu Tyr Gly Val Leu Ala Cys Thr Ser Val Phe Ala His Gin 

15 10 15 

lie Ala Ala Ala Leu Leu Gly Leu Leu Trp lie Gin Ser Ala Tyr lie 

20 25 30 

Gly His Asp Ser Gly His Tyr Val lie Met Ser Asn Lys Ser Tyr Asn 
35 40 45 



1C Ar 9 Phe Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly He Ser He 

15 50 55 60 



Ala Trp Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser 
65 70 75 80 

Leu Asp Tyr Asp Pro Asp Leu Gin His He Pro Val Phe Ala Val Ser 
85 90 95 

Thr Lys Phe Phe Ser Ser Leu Thr Ser Arg Phe Tyr Asp Arg Lys Leu 
100 105 no 

Thr Phe Gly Pro Val Ala Arg Phe Leu Val Ser Tyr Gin His Phe Thr 
H5 120 125 



~ n T y r Pro Vai Asn Cys Phe Gly Arg He Asn Leu Phe He Gin Thr 

JU 130 135 140 

Phe Leu Leu Leu Phe Ser Lys Arg Glu Val Pro Asp Arg Ala Leu Asn 
I 45 150 155 160 

3 ^ phe Aia G ly He Leu Val Phe Trp Thr Trp Phe Pro Leu Leu Val Ser 

165 170 175 

Cys Leu Pro Asn Trp Pro Glu Arg Phe Phe Phe Val Phe Thr Ser Phe 
4Q 180 185 190 

Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu Asn His Phe Ala 
195 200 205 

Aei Ala As P Vai T V r Va l Gly Pro Pro Thr Gly Ser Asp Trp Phe Glu Lys 

43 210 215 220 



Gin Ala Ala Gly Thr He Asp He Ser Cys Arg Ser Tyr Met Asp Trp 
225 230 235 240 

Phe Phe Gly Gly Leu Gin Phe Gin Leu Glu His His 
245 250 

(2) INFORMATION FOR SEQ ID NO: 10: 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS : not relevant 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Asn Phe Ala Gly He Leu Val Phe Trp Thr Trp Phe Pro 
15 10 15 

lrt Leu Leu Val Ser C V S L eu Pro Asn Trp Pro Glu Arg Phe Xaa Phe Val 

10 20 25 30 
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Phe Thr Gly Phe Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu 
35 40 45 

Asn His Phe Ala Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp 
50 55 60 

Trp Phe Glu Lys Gin Ala Ala Gly -Thr He Asp He Ser Cys Arg Ser 
65 70 75 80 

Tyr Met Asp Trp Phe Phe Cys Gly Leu Gin Phe Gin Leu Glu His His 
85 90 95 



Leu Phe Pro Arg Leu Pro Arg Cys His Leu Arg Lys Val Ser Pro Val 
Z:> 100 105 110 

Gly Gin Arg Gly Phe Gin Arg Lys Xaa Asn Leu Ser Xaa 
115 120 125 

30 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

35 <C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Pro Ala Thr Glu Val Gly Gly Leu Ala Trp Met He Thr Phe Tyr Val 

1 5 io 15 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 

20 25 30 

Gly Leu Phe Phe He Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 
35 40 45 



« Val Thr Gln Met Asn His He Pro Met His He Asp His Asp Arg Asn 

33 50 55 60 

Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 
65 70 75 80 



Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 
85 90 95 
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His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Xaa Val Ala 
100 105 HO 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Ser 
5 115 120 125 

Lys Pro Leu 
130 

10 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Cys Ser Pro Lys Ser Ser Pro Thr Arg Asn Met Thr Pro Ser Pro Phe 
15 10 15 

He Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
20 25 30 

Phe Pro Thr Met Pro Arg Cys Asn Leu Asn Arg Cys Met Lys Tyr Val 
35 40 45 

„ L y s Glu T rp Cys Ala Glu Asn Asn Leu Pro Tyr Leu Val Asp Asp Tyr 

35 50 55 60 



Phe Val Gly Tyr Asn Leu Asn Leu Gin Gin Leu Lys Asn Met Ala Glu 

65 *70 75 80 

Leu Val Gin Ala Lys Ala Ala 
85 



(2) INFORMATION FOR SEQ ID NO: 13: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Arg His Glu Ala Ala Arg Gly Gly Thr Arg Leu Ala Tyr Met Leu Val 
15 10 15 

Cys Met Gin Trp Thr Asp Leu Leu Trp Ala Ala Ser Phe Tyr Ser Arg 
20 25 30 
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Phe Phe Leu Ser Tyr Ser Pro Phe Tyr Gly Ala Thr Gly Thr Leu Leu 
35 40 45 

Leu Phe Val Ala Val Arg Val Leu Glu Ser His Trp Phe Val Trp He 

50 55 60 

Thr Gin Met Asn His He Pro Lys Glu He Gly His Glu Lys His Arg 
65 70 75 80 

Asp Trp Ala Ser Ser Gin Leu Ala Ala Thr Cys Asn Val Glu Pro Ser 
85 90 95 

Leu Phe He Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu His 
100 105 no 

His Leu Phe Pro Thr Met Thr Arg His Asn Tyr Arg Xaa Val Ala Pro 
13 -5 120 125 

Leu Val Lys Ala Phe Cys Ala Lys His Gly Leu His Tyr Glu Val 

130 135 140 

(2) INFORMATION FOR SEQ ID NO: 14: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu His His Thr Tyr Thr Asn He Ala Gly Ala Asp Pro Asp Val Ser 
15 10 15 

Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp Phe 
20 25 30 

Val Asn His He Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly Leu 
35 40 45 

Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe Val 
5 0 55 so 

Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His Thr 
65 70 75 80 

Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu He 
85 90 95 

Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe Thr 
1Q 0 105 no 

Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin Ala 
H5 120 125 
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Asn Tyr Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn Gly 
130 135 140 

He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin Asp 
145 150 155 160 

Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu Asn 
165 170 175 

Tyr Gin Xaa Val His His Leu Phe Pro His 
180 185 

(2) INFORMATION FOR SEQ ID NO: 15: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: linear 



10 



20 



25 



30 



35 



40 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Xaa Xaa His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ala Ala Gin He Lys Lys Tyr He Thr Ser Asp Glu Leu Lys Asn 

15 10 15 

His Asp Lys Pro Gly Asp Leu Trp He Ser He Gin Gly Lys Ala Tyr 
20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 

35 4 0 45 

Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 

50 55 60 



Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tvr 

60 65 70 75 80 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Val Tyr Arg Lys Leu 
85 90 95 
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Val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His lie 
100 105 no 

Met Phe Ala Thr Leu Cys Phe He Ala Met Leu Phe Ala Met Ser Val 
115 120 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cys Leu Met Gly Phe Leu Trp He Gin Ser Gly Trp He Gly His Asp 
145 150 155 160 



Aia G ly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 
15 165 170 175 



Gly He Phe Ala Ala Asn Cys Leu Ser Gly He Ser He Gly Trp Trp 
180 185 190 

Lys Trp Asn His Asn Ala His His He Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 

Asp Pro Asp Leu Gin Tyr He Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 

Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 235 240 

Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 
245 250 255 

He Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu He Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser He Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg He Met Phe Val He Ala Ser Leu Ser Val Thr 
305 310 315 320 

Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
325 330 335 

Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
340 345 350 

Gly Thr Leu Asp He Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
355 360 365 

Gly Leu Gin Phe Gin He Glu His His Leu Phe Pro Lys Met Pro Arg 
370 375 380 

Cys Asn Leu Arg Lys He Ser Pro Tyr Val He Glu Leu Cys Lys Lys 
385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 
405 410 415 
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Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp He Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 17: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 



10 



(i) 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



25 



30 



40 



45 



50 



55 



60 



Met Leu Thr Ala Glu Arg He Lys Phe Thr Gin Lys Arg Gly Phe Arg 
15 10 15 

Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 
20 25 30 

Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu He He Val 
35 40 45 

Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val He 
50 55 60 



~ c phe Pro Val Ar <? Leu Leu Gly Cys Met Val Leu Ala He Ala Leu Ala 

33 65 70 75 80 



Ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 
85 90 95 

Ser Asn Pro His He Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 
100 105 HO 

Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
115 120 125 

Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp Glv 
130 135 140 

Ala Val Arg Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 
145 ISO 155 160 

Gin Gin Phe Tyr He Trp Gly Leu Tyr Leu Phe He Pro Phe Tyr Trp 
165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 
180 185 190 

His Lys He Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Leu Gly 
195 200 205 

He Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 
210 215 220 
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Gly Phe Ser lie Pro Glu Val Leu lie Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly lie Val Val Cys Thr He Phe Met Leu Ala His Val Leu 
245 250 255 

Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala He Asp 
260 265 270 

Asp Glu Trp Ala He Cys Gin He Arg Thr Thr Ala Asn Phe Ala Thr 
275 280 285 



Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 
15 290 295 300 



Thr His His Leu Phe Pro Asn He Cys His He His Tyr Pro Gin Leu 
305 310 315 320 

Glu Asn He He Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 
325 330 335 

Val Tyr Pro Thr Phe Lys Ala Ala He Ala Ser Asn Tyr Arg Trp Leu 
340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 

(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not relevant 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Thr Ser Thr Thr Ser Lys Val Thr Phe Gly Lys Ser He Gly Phe 
15 10 15 

Arg Lys Glu Leu Asn Arg Arg Val Asn Ala Tyr Leu Glu Ala Glu Asn 
20 25 30 

He Ser Pro Arg Asp Asn Pro Pro Met Tyr Leu Lys Thr Ala He He 
35 40 45 

Leu Ala Trp Val Val Ser Ala Trp Thr Phe Val Val Phe Gly Pro Asp 
50 55 60 

Val Leu Trp Met Lys Leu Leu Gly Cys He Val Leu Gly Phe Gly Val 
65 70 75 80 

Ser Ala Val Gly Phe Asn He Ser His Asp Gly Asn His Gly Gly Tyr 
85 90 95 
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Ser Lys Tyr Gin Trp Val Asn Tyr Leu Ser Gly Leu Thr His Asp Ala 
100 105 110 

He Gly Val Ser Ser Tyr Leu Trp Lys Phe Arg His Asn Val Leu His 
115 120 125 

His Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp 
130 135 140 

Glu Leu Val Arg Met Ser Pro Ser Met Glu Tyr Arg Trp Tyr His Arg 
145 150 155 160 

Tyr Gin His Trp Phe He Trp Phe Val Tyr Pro Phe He Pro Tyr Tyr 
165 170 175 

Trp Ser He Ala Asp Val Gin Thr Met Leu Phe Lys Arg Gin Tyr His 
180 185 190 

^ As P His Glu He Pro Ser Pro Thr Trp Val Asp He Ala Thr Leu Leu 

ZU 195 200 205 

Ala Phe Lys Ala Phe Gly Val Ala Val Phe Leu He He Pro He Ala 
210 215 220 

25 Val G1 y T y r Ser Pro Leu Glu Ala Val He Gly Ala Ser He Val Tyr 

225 230 235 240 

Met Thr His Gly Leu Val Ala Cys Val Val Phe Met Leu Ala His Val 
245 250 255 

He Glu Pro Ala Glu Phe Leu Asp Pro Asp Asn Leu His He Asp Asp 
260 265 270 

- c Glu Tr P Ala Iie Ala Gin Val Lys Thr Thr Val Asp Phe Ala Pro Asn 

275 280 285 

Asn Thr He He Asn Trp Tyr Val Gly Gly Leu Asn Tyr Gin Thr Val 
2 90 295 300 

His His Leu Phe Pro His He Cys His He His Tyr Pro Lys He Ala 
305 310 315 320 

Pro He Leu Ala Glu Val Cys Glu Glu Phe Gly Val Asn Tyr Ala Val 
325 330 335 

His Gin Thr Phe Phe Gly Ala Leu Ala Ala Asn Tyr Ser Trp Leu Lys 
340 345 350 



30 



40 



45 



cn L V S Met Ser He Asn Pro Glu Thr Lys Ala He Glu Gin 

50 355 360 365 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

5 CCAAGCTTCT GCAGGAGCTC TTTTTTTTTT TTTTT 35 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Synthetic oligonucleotide" 



20 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /number= 1 
/note= "N=Inosine or Cytosine" 

25 (ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 27 

(D) OTHER INFORMATION: /number" 2 
/note= "N=Inosine or Cytosine" 



30 



35 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CUACUACUAC U AC AY CAY AC NTAYACNAAY AT 32 
(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 
45 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
50 (B) LOCATION: 13 

(D) OTHER INFORMATION: /number= 1 
/note 5 * "N=Inosine or Cytosine" 

(ix) FEATURE: 
55 (A) NAME /KEY : misc_f eature 

(B) LOCATION; 19 

(D) OTHER INFORMATION: /number^ 2 
/note« "N=Inosine or Cytosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
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CAUCAUCAUC AUNGGRAANA RRTGRTG 2 7 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CUACUACUAC UAGGAGTCCT CTACGGTGTT TTG 
20 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

35 CAUCAUCAUC AUATGATGCT CAAGCTGAAA CTG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(ii SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Gin Xaa Xaa His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CUACUACUAC UACTCGAGCA AGATGGGAAC GGACCAAGG 39 
10 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: other nucleic acid 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 CAUCAUCAUC AUCTCGAGCT ACTCTTCCTT GGGACGGAG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

35 (ii) MOLECULE TYPE: other nucleic acid 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CUACUACUAC UATCTAGACT CGAGACCATG GCTGCTGCTC CAGTGTG 47 
(2) INFORMATION FOR SEQ ID NO: 28: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 ( D) TOPOLOGY : linear 



55 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



CAUCAUCAUC AUAGGCCTCG AGTTACTGCG CCTTACCCAT 4 0 

60 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 



CUACUACUA CUAGGATCCA TGGCACCTCC CAACACT 37 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
-(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

CAUCAUCAU CAUGGTACCT CGAGTTACTT CTTGAAAAAG AC 4 2 

(2) INFORMATION FOR SEQ ID NO: 31: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2692004) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



GCACGCCGAC 


CGGCGCCGGG 


AGATCCTGGC 


AAAGTATCCA 


GAGATAAAGT 


CCTTGATGAA 


60 


ACCTGATCCC 


AATTTGATAT 


GGATTATAAT 


TATGATGGTT 


CTCACCCAGT 


TGGGTGCATT 


120 


TTACATAGTA 


AAAGACTTGG 


ACTGGAAATG 


GGTCATATTT 


GGGGCCTATG 


CGTTTGGCAG 


180 


TTGCATTAAC 


CACTCAATGA 


CTCTGGCTAT 


TCATGAGATT 


GCCCACAATG 


CTGCCTTTGG 


240 


CAACTGCAAA 


GCAATGTGGA 


ATCGCTGGTT 


TGGAATGTTT 


GCTAATCTTC 


CTATTGGGAT 


300 


TCCATATTCA 


ATTTCCTTTA 


AGAGGTATCA 


CATGGATCAT 


CATCGGTACC 


TTGGAGCTGA 


360 


TGGCGTCGAT 


GTAGATATTC 


CTACCGATTT 


TGAGGGCTGG 


TTCTTCTGTA 


CCGCTTTCAG 


420 


AAAGTTTATA 


TGGGTTATTC 


TTCAGCCTCT 


CTTTTATGCC 


TTTCGACCTC 


TGTTCATCAA 


480 
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30 



CCCCAAACCA 


ATTACGTATC 


TGGAAGTTAT 


CAATACCGTG 


GCACAGGTCA 


CTTTTGACAT 


540 


TTTAATTTAT 


TACTTTTTGG 


GAATTAAATC 


CTTAGTCTAC 


ATGTTGGCAG 


CATCTTTACT 


600 


TGGCCTGGGT 


TTGCACCCAA 


TTTCTGGACA 


TTTTATAGCT 


GAGCATTACA 


TGTTCTTAAA 


660 


GGGTCATGAA 


ACTTACTCAT 


ATTATGGGCC 


TCTGAATTTA 


CTTACCTTCA 


ATGTGGGTTA 


720 


TCATAATGAA 


CATCATGATT 


TCCCCAACAT 


TCCTGGAAAA 


AGTCTTCCAC 


TGGTGAGGAA 


780 


AATAGCAGCT 


GAATACTATG 


ACAACCTCCC 


TCACTACAAT 


TCCTGGATAA 


AAGTACTGTA 


840 


TGATTTTGTG 


ATGGATGATA 


CAATAAGTCC 


CTACTCAAGA 


ATGAAGAGGC 


ACCAAAAAGG 


900 


AGAGATGGTG 


CTGGAGTAAA 


TATCATTAGT 


GCCAAAGGGA 




rirl/iO 111 AuA 


y du 


TGATAAAATG 


GAATTTTTGC 


ATTATTAAAC 


TTGAGACCAG 


TGATGCTCAG 


AAGCTCCCCT 


1020 


GGCACAATTT 


C AG AG T AAG A 


GCTCGGTGAT 


ACCAAGAAGT 


GAATCTGGCT 


TTTAAACAGT 


1080 


CAGCCTGACT 


CTGTACTGCT 


CAGTTTCACT 


CACAGGAAAC 


TTGTGACTTG 


TGTATTATCG 


1140 


TCATTGAGGA 


TGTTTCACTC 


ATGTCTGTCA 


TTTTATAAGC 


ATATCATTTA 


AAAAGCTTCT 


1200 


AAAAAGCTAT 


TTCGCCAGG 










1219 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



45 


TTACCTTCTA 


CGTCCGCTTC 


TTCCTCACTT 


ATGTGCCACT 


ATTGGGGCTG 


AAAGCTTCCT 


60 




GGGCCTTTTC 


TTCATAGTCA 


GGTTCCTGGA 


AAGCAACTGG 


TTTGTGTGGG 


TG AC AC AG AT 


120 




GAACCATATT 


CCCATGCACA 


TTGATCATGA 


CCGGAACATG 


GACTGGGTTT 


CCACCCAGCT 


180 


50 


CCAGGCCACA 


TGCAATGTCC 


ACAAGTCTGC 


CTTCAATGAC 


TGGTTCAGTG 


GACACCTCAA 


240 




CTTCCAGATT 


GAGCACCATC 


TTTTTCCCAC 


GATGCCTCGA 


CACAATTACC 


ACAAAGTGGC 


300 


55 


TCCCCTGGTG 


CAGTCCTTGT 


GTGCCAAGCA 


TGGCATAGAG 


TACCAGTCCA 


AGCCCCTGCT 


360 




GTCAGCCTTC 


GCCGACATCA 


TCCACTCACT 


AAAGGAGTCA 


GGGCAGCTCT 


GGCTAGATGC 


420 




CTATCTTCAC 


CAATAACAAC 


AGCCACCCTG 


CCCAGTCTGG 


AAGAAGAGGA 


GGAAGACTCT 


480 


60 


GGAGCCAAGG 


CAGAGGGGAG 


CTTGAGGGAC 


AATGCCACTA 


TAGTTTAATA 


CTCAGAGGGG 


540 




GTTGGGTTTG 


GGGACATAAA 


GCCTCTGACT 


CAAACTCCTC 


CCTTTTATCT 


TCTAGCCACA 


600 
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GTTCTAAGAC CCAAAGTGGG GGGTGGACAC AGAAGTCCCT AGGAGGGAAG GAGCT 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3506132) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 
TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 
CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 
AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 
CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 
AAGA 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CAGGGACCTA 


CCCCGCGCTA 


CTTCACCTGG 


GACGAGGTGG 


CCCAGCGCTC 


AGGGTGCGAG 


GAGCGGTGGC 


TAGTGATCGA 


CCGTAAGGTG 


TACAACATCA 


GCGAGTTCAC 


CCGCCGGCAT 


CCAGGGGGCT 


CCCGGGTCAT 


CAGCCACTAC 


GCCGGGCAGG 


ATGCCACGGA 


TCCCTTTGTG 


GCCTTCCACA 


TCAACAAGGG 


CCTTGTGAAG 


AAGTATATGA 


ACTCTCTCCT 


GATTGGAGAA 


CTGTCTCCAG 


AGCAGCCCAG 


CTTTGAGCCC 


ACCAAGAATA 


AAGAGCTGAC 


AGATGAGTTC 


CGGGAGCTGC 


GGGCCACAGT 


GGAGCGGATG 


GGGCTCATGA 


AGGCCAACCA 


TGTCTTCTTC 


CTGCTGTACC 


TGCTGCACAT 


CTTGCTGCTG 


GATGGTGCAG 


CCTGGCTCAC 


CCTTTGGGTC 


TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGGCC 


CAGGCTGGCT 


GGCTGCAGCA 


TGACTTTGGG 


CACCTGTCGG 


TCTTCAGCAC 


CTCAAAGTGG 


AACCATCTGC 


TACATCATTT 


TGTGATTGGC 


CACCTGAAGG 


GGGCCCCCGC 


CAGTTGGTGG 


AACCACATGC 


ACTTCCAGCA 


CCATGCCAAG 


CCCAACTGCT 


TCCGCAAAGA 


CCCAGACATC 
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AACATGCATC CCTTCTTCTT TGCCTTGGGG AAGATCCTCT CTGTGGAGCT TGGGAAACAG 720 

AAGAAAAAAT ATATGCCGTA CAACCACCAG CACARATACT TCTTCCTAAT TGGGCCCCCA 7 80 

GCCTTGCTGC CTCTCTACTT CCAGTGGTAT ATTTTCTATT TTGTTATCCA GCGAAAGAAG 8 40 

TGGGTGGACT TGGCCTGGAT CAGCAAACAG GAATACGATG AAGCCGGGCT TCCATTGTCC 900 
10 ACCGCAAATG CTTCTAAA 



(2) INFORMATION FOR SEQ ID NO: 35: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2511785) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



918 





GCCACTTAAA 


GGGTGCCTCT 


GCCAACTGGT 


GGAATCATCG 


CCACTTCCAG 


CACCACGCCA 


60 




AGCCTAACAT 


CTTCCACAAG 


GATCCCGATG 


TGAACATGCT 


GCACGTGTTT 


GTTCTGGGCG 


120 


30 


AATGGCAGCC 


CATCGAGTAC 


GGCAAGAAGA 


AGCTGAAATA 


CCTGCCCTAC 


AATCACCAGC 


180 




ACGAATACTT 


CTTCCTGATT 


GGGCCGCCGC 


TGCTCATCCC 


CATGTATTTC 


CAGTACCAGA 


240 


35 


TCATCATGAC 


CATGATCGTC 


CATAAGAACT 


GGGTGGACCT 


GGCCTGGGCC 


GTCAGCTACT 


300 




ACATCCGGTT 


CTTCATCACC 


TACATCCCTT 


TCTACGGCAT 


CCTGGGAGCC 


CTCCTTTTCC 


360 




TCAACTTCAT 


CAGGTTCCTG 


GAGAGCCACT 


GGTTTGTGTG 


GGTCACACAG 


ATGAATCACA 


420 


40 


TCGTCATGGA 


GATTGACCAG 


GAGGCCTACC 


GTGACTGGTT 


CAGTAGCCAG 


CTGACAGCCA 


480 




CCTGCAACGT 


GGAGCAGTCC 


TTCTTCAACG 


ACTGGTTCAG 


TGGACACCTT 


AACTTCCAGA 


540 


45 


TTGAGCACCA 


CCTCTTCCCC 


ACCATGCCCC 


GGCACAACTT 


ACACAAGATC 


GCCCCGCTGG 


600 




TGAAGTCTCT 


ATGTGCCAAG 


CATGGCATTG 


AATACCAGGA 


GAAGCCGCTA 


CTGAGGGCCC 


660 




TGCTGGACAT 


CATCAGGTCC 


CTGAAGAAGT 


CTGGGAAGCT 


GTGGCTGGAC 


GCCTACCTTC 


720 


50 


ACAAATGAAG 


CCACAGCCCC 


CGGGACACCG 


TGGGGAAGGG 


GTGCAGGTGG 


GGTGATGGCC 


780 




AGAGGAATGA 


TGGGCTTTTG 


TTCTGAGGGG 


TGTCCGAGAG 


GCTGGTGTAT 


GCACTGCTCA 


840 


55 


CGGACCCCAT 


GTTGGATCTT 


TCTCCCTTTC 


TCCTCTCCTT 


TTTCTCTTCA 


CATCTCCCCC 


900 




ATAGCACCCT 


GCCCTCATGG 


GACCTGCCCT 


CCCTCAGCCG 


TCAGCCATCA 


GCCATGGCCC 


960 




TCCCAGTGCC 


TCCTAGCCCC 


TTCTTCCAAG 


GAGCAGAGAG 


GTGGCCACCG 


GGGGTGGCTC 


1020 


60 


TGTCCTACCT 


CCACTCTCTG 


CCCCTAAAGA 


TGGGAGGAGA 


CCAGCGGTCC 


ATGGGTCTGG 


1080 




CCTGTGAGTC 


TCCCCTTGCA 


GCCTGGTCAC 


TAGGCATCAC 


CCCCGCTTTG 


GTTCTTCAGA 


1140 
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TGCTCTTGGG GTTCATAGGG GCAGGTCCTA GTCGGGCAGG GCCCCTGACC CTCCCGGCCT 
GGCTTCACTC TCCCTGACGG CTGCCATTGG TCCACCCTTT CATAGAGAGG CCTGCTTTGT 
TACAAAGCTC GGGTCTCCCT CCTGCAGCTC GGTTAAGTAC CCGAGGCCTC TCTTAAGATG 
TCCAGGGCCC CAGGCCCGCG GGCACAGCCA GCCCAAACCT TGGGCGCTGG AAGAGTCCTC 
CACCCCATCA CTAGAGTGCT CTGACCCTGG GCTTTCACGG GCCCCATTCC ACCGCCTCCC 
CAACTTGAGC CTGTGACCTT GGGACCAAAG GGGGAGTCCC TCGTCTCTTG TGACTCAGCA 
GAGGCAGTGG CCACGTTCAG GGAGGGGCCG GCTGGCCTGG AGGCTCAGCC CACCCTCCAG 
CTTTTCCTCA GGGTGTCCTG AGGTCCAAGA TTCTGGAGCA ATCTGACCCT TCTCCAAAGG 
CTCTGTTATC AGCTGGGCAG TGCCAGCCAA TCCCTGGCCA TTTGGCCCCA GGGGACGTGG 
GCCCTG 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid (Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



GTCTTTTACT 


TTGGCAATGG 


CTGGATTCCT 


ACCCTCATCA 


CGGCCTTTGT 


CCTTGCTACC 


TCTCAGGCCC 


AAGCTGGATG 


GCTGCAACAT 


GATTATGGCC 


ACCTGTCTGT 


CTACAGAAAA 


CCCAAGTGGA 


ACCACCTTGT 


CCACAAATTC 


GTCATTGGCC 


ACTTAAAGGG 


TGCCTCTGCC 


AACTGGTGGA 


ATCATCGCCA 


CTTCCAGCAC 


CACGCCAAGC 


CTAACATCTT 


CCACAAGGAT 


CCCGATGTGA 


ACATGCTGCA 


CGTGTTTGTT 


CTGGGCGAAT 


GGCAGCCCAT 


CGAGTACGGC 


AAGAAGAAGC 


TGAAATACCT 


GCCCTACAAT 


CACCAGCACG 


AATACTTCTT 


CCTGATTGGG 


CCGCCGCTGC 


TCATCCCCAT 


GTATTTCCAG 


TACCAGATCA 


TCATGACCAT 


GATCGTCCAT 


AAGAACTGGG 


TGGACCTGGC 


CTGGGCCGTC 


AGCTACTACA 


TCCGGTTCTT 


CATCACCTAC 


ATCCCTTTCT 


ACGGCATCCT 


GGGAGCCCTC 


CTTTTCCTCA 


ACTTCATCAG 


GTTCCTGGAG 


AGCCACTGGT 


TTGTGTGGGT 


CACACAGATG 


AATCACATCG 


TCATGGAGAT 


TGACCAGGAG 


GCCTACCGTG 


ACTGGTTCAG 


TAGCCAGCTG 


ACAGCCACCT 


GCAACGTGGA 


GCAGTCCTTC 


TTCAACGACT 


GGTTCAGTGG 


ACACCTTAAC 


TTCCAGATTG 


AGCACCACCT 


CTTCCCCACC 


ATGCCCCGGC 


ACAACTTACA 


CAAGATCGCC 


CCGCTGGTGA 


AGTCTCTATG 


TGCCAAGCAT 


GGCATTGAAT 


ACCAGGAGAA 


GCCGCTACTG 


AGGGCCCTGC 


TGGACATCAT 


CAGGTCCCTG 
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AAGAAGTCTG GGAAGCTGTG GCTGGACGCC TACCTTCACA AATGAAGCCA CAGCCCCCGG 900 

GACACCGTGG GGAAGGGGTG CAGGTGGGGT GATGGCCAGA GGAATGATGG GCTTTTGTTC 960 

TGAGGGGTGT CCGAGAGGCT GGTGTATGCA CTGCTCACGG ACCCCATGTT GGATCTTTCT 1020 

CCCTTTCTCC TCTCCTTTTT CTCTTCACAT CTCCCCCATA GCACCCTGCC CTCATGGGAC 1080 

10 CTGCCCTCCC TCAGCCGTCA GCCATCAGCC ATGGCCCTCC CAGTGCCTCC TAGCCCCTTC 114 0 

TTCCAAGGAG CAGAGAGGTG GCCACCGGGG GTGGCTCTGT CCTACCTCCA CTCTCTGCCC 1200 

CTAAAGATGG GAGGAGACCA GCGGTCCATG GGTCTGGCCT GTGAGTCTCC CCTTGCAGCC 12 60 

TGGTCACTAG GCATCACCCC CGCTTTGGTT CTTCAGATGC TCTTGGGGTT CATAGGGGCA 1320 

GGTCCTAGTC GGGCAGGGCC CCTGACCCTC CCGGCCTGGC TTCACTCTCC CTGACGGCTG 1380 

20 CCATTGGTCC ACCCTTTCAT AGAGAGGCCT GCTTTGTTAC AAAGCTCGGG TCTCCCTCCT 14 40 

GCAGCTCGGT TAAGTACCCG AGGCCTCTCT TAAGATGTCC AGGGCCCCAG GCCCGCGGGC 1500 

^ ACAGCCAGCC CAAACCTTGG GCCCTGGAAG AGTCCTCCAC CCCATCACTA GAGTGCTCTG 1560 

ACCCTGGGCT TTCACGGGCC CCATTCCACC GCCTCCCCAA CTTGAGCCTG TGACCTTGGG 1620 

ACCAAAGGGG GAGTCCCTCG TCTCTTGTGA CTCAGCAGAG GCAGTGGCCA CGTTCAGGGA 1680 

30 GGGGCCGGCT GGCCTGGAGG CTCAGCCCAC CCTCCAGCTT TTCCTCAGGG TGTCCTGAGG 1740 

TCCAAGATTC TGGAGCAATC TGACCCTTCT CCAAAGGCTC TGTTATCAGC TGGGCAGTGC 1800 

CAGCCAATCC CTGGCCATTT GGCCCCAGGG GACGTGGGCC CTG 1843 



35 



60 



(2) INFORMATION FOR SEQ ID NO: 37: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: other nucleic acid (Edited Contig 253538a) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

^ CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG TACAACATCA GCGAGTTCAC CCGCCGGCAT 120 

CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

55 GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 2 40 

CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 

CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 360 

CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 420 
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TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGCAG 


480 




GCCCAAGCTG 


GATGGCTGCA 


ACATGATTAT 


GGCCACCTGT 


CTGTCTACAG 


AAAACCCAAG 


540 


5 


TGGAACCACC 


TTGTCCACAA 


ATTCGTCATT 


GGCCACTTAA 


AGGGTGCCTC 


TGCCAACTGG 


600 




TGGAATCATC 


GCCACTTCCA 


GCACCACGCC 


AAGCCTAACA 


TCTTCCACAA 


GGATCCCGAT 


660 


10 


GTGAACATGC 


TGCACGTGTT 


TGTTCTGGGC 


GAATGGCAGC 


CCATCGAGTA 


CGGCAAGAAG 


720 


AAGCTGAAAT 


ACCTGCCCTA 


CAATCACCAG 


CACGAATACT 


TCTTCCTGAT 


TGGGCCGCCG 


780 




CTGCTCATCC 


CCATGTATTT 


CCAGTACCAG 


ATCATCATGA 


CCATGATCGT 


CCATAAGAAC 


840 


15 


TGGGTGGACC 


TGGCCTGGGC 


CGTCAGCTAC 


TACATCCGGT 


TCTTCATCAC 


CTACATCCCT 


900 




TTCTACGGCA 


TCCTGGGAGC 


CCTCCTTTTC 


CTCAACTTCA 


TCAGGTTCCT 


GGAGAGCCAC 


960 


20 


TGGTTTGTGT 


GGGTCACACA 


GATGAATCAC 


ATCGTCATGG 


AGATTGACCA 


GGAGGCCTAC 


1020 


CGTGACTGGT 


TCAGTAGCCA 


GCTGACAGCC 


ACCTGCAACG 


TGGAGCAGTC 


CTTCTTCAAC 


1080 




GACTGGTTCA 


GTGGACACCT 


TAACTTCCAG 


ATTGAGCACC 


ACCTCTTCCC 


CACCATGCCC 


1140 


25 


CGGCACAACT 


TACACAAGAT 


CGCCCCGCTG 


GTGAAGTCTC 


TATGTGCCAA 


GC AT GGCATT 


1200 




GAATACCAGG 


AGAAGCCGCT 


ACTGAGGGCC 


CTGCTGGACA 


TCATCAGGTC 


CCTGAAGAAG 


1260 


30 


TCTGGGAAGC 


TGTGGCTGGA 


CGCCTACCTT 


CACAAATGAA 


GCCACAGCCC 


CCGGGACACC 


1320 


GTGGGGAAGG 


GGTGCAGGTG 


GGGTGATGGC 


CAGAGGAATG 


ATGGGCTTTT 


GTTCTGAGGG 


1380 




GTGTCCGAGA 


GGCTGGTGTA 


TGCACTGCTC 


ACGGACCCCA 


TGTTGGATCT 


TTCTCCCTTT 


1440 


35 


CTCCTCTCCT 


TTTTCTCTTC 


ACATCTCCCC 


CATAGCACCC 


TGCCCTCATG 


GGACCTGCCC 


1500 




TCCCTCAGCC 


GTCAGCCATC 


AGCCATGGCC 


CTCCCAGTGC 


CTCCTAGCCC 


CTTCTTCCAA 


1560 


40 


GGAGCAGAGA 


GGTGGCCACC 


GGGGGTGGCT 


CTGTCCTACC 


TCCACTCTCT 


GCCCCTAAAG 


1620 




ATGGGAGGAG 


ACCAGCGGTC 


CATGGGTCTG 


GCCTGTGAGT 


CTCCCCTTGC 


AGCCTGGTCA 


1680 




CTAGGCATCA 


CCCCCGCTTT 


GGTTCTTCAG 


ATGCTCTTGG 


GGTTCATAGG 


GGCAGGTCCT 


1740 


45 


AGTCGGGCAG 


GGCCCCTGAC 


CCTCCCGGCC 


TGGCTTCACT 


CTCCCTGACG 


GCTGCCATTG 


1800 




GTCCACCCTT 


TCATAGAGAG 


GCCTGCTTTG 


TTACAAAGCT 


CGGGTCTCCC 


TCCTGCAGCT 


1860 


50 


CGGTTAAGTA 


CCCGAGGCCT 


CTCTTAAGAT 


GTCCAGGGCC 


CCAGGCCCGC 


GGGCACAGCC 


1920 




AGCCCAAACC 


TTGGGCCCTG 


GAAGAGTCCT 


CCACCCCATC 


ACTAGAGTGC 


TCTGACCCTG 


1980 




GGCTTTCACG 


GGCCCCATTC 


CACCGCCTCC 


CCAACTTGAG 


CCTGTGACCT 


TGGGACCAAA 


2040 


55 


GGGGGAGTCC 


CTCGTCTCTT 


GTGACTCAGC 


AGAGGCAGTG 


GCCACGTTCA 


GGGAGGGGCC 


2100 




GGCTGGCCTG 


GAGGCTCAGC 


CCACCCTCCA 


GCTTTTCCTC 


AGGGTGTCCT 


GAGGTCCAAG 


2160 


60 


ATTCTGGAGC 


AATCTGACCC 


TTCTCCAAAG 


GCTCTGTTAT 


CAGCTGGGCA 


GTGCCAGCCA 


2220 




ATCCCTGGCC 


ATTTGGCCCC 


AGGGGACGTG 


GGCCCTG 






2257 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2 692004) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



His 


Ala 


Asp 


Arg 


Arg 


Arg 


Ul u 


X Xc 


Leu 


nia 


Lys 


Tyr 


Pro 


Glu 


He 


1 








5 










1 Pi 










15 


Lvs 


Ser 


Leu 


Met 


Lys 
20 


Pro 


nSp 


tr 4. \j 


As n 


Leu 


lie 


Trp 


He 


lie 


He 

JU 


Met 


Met 


Val 


Leu 


Thr 


VJlU 


T 

Leu 


uiy 


n i a 

rilo 


pne 


Tyr 


I le 


Val 


Lys 


Asp 










35 










40 








45 


Leu 


Asp 


Trp 


Lvs 
j 


Tro 


Val 


He 


Phe 


Gly 


Ala 


Tyr 


a 1 a 
nia 


Phe 


Gly 


Ser 










50 


















bU 


Cys 


lie 


Asn 


His 


Ser 
65 


Met 


Thr 


Leu 


ni ct 


Tip 
X Xc 

70 


nis 


ulU 


He 


Ala 


His 
75 


Asn 


Ala 


Ala 


Phe 


Glv 


Asn 


Cys 


Lys 


Ala 
nxa 




Trp 


Asn 


Arg 


Trp 


Pne 










80 










85 










Gly 


Met 


Phe 


Ala 


Asn 
95 


Leu 


Pro 


He 


Gly 


lie 

100 


Pro 


Tyr 


Ser 


lie 


Ser 
105 


Phe 


Lys 


Arq 


Tvr 


His 


Met 


Asp 


His 


His 


Arg 


Tyr 


Leu 


Gly Ala 


Asp 










110 










115 










120 


Gly 


Val 


Asp 


Val 


Asp 


He 


Pro 


Thr 




rne 


uiu 


Vjiy 


Trp 


Phe 


O V-> /-v 

Pne 










125 










130 








± JO 


Cys 


Thr 


Ala 


Phe 


Aro 
140 


Lvs 


Phe 


lie 


Trn 


Val 
145 


Tl/s 
X Xt5 


Leu 


Gin 


Pro 


Leu 

13U 


Phe 


Tyr 


Ala 


Phe 


Aro 


Pro 


Leu 


Phe 


He 


Asn 


Prn 


Lys 


Pro 


He 


i nr 










155 










160 








165 


Tyr 


Leu 


Glu 


Val 


He 


Asn 


Thr 


Val 


Ala 


oxn 


v a x 




Phe 


Asp 


lie 










170 










175 








1 PA 


Leu 


He 


Tyr 


Tyr 


Phe 


Leu 


Gly 


He 


Lys 


Ser 


Leu 


Val 


Tyr 


Met 


Leu 










185 










190 








195 


Ala 


Ala 


Ser 


Leu 


Leu 


Gly 


Leu 


Gly 


Leu 


His 


Pro 


He 


Ser 


Gly 


His 










200 










205 








210 


Phe 


He 


Ala 


Glu 


His 
215 


Tyr 


Met 


Phe 


Leu 


Lys 
220 


Gly 


His 


Glu 


Thr 


Tyr 
225 


Ser 


Tyr 


Tyr 


Gly 


Pro 


Leu 


Asn 


Leu 


Leu 


Thr 


Phe 


Asn 


Val 


Gly 


Tyr 


His 








230 










235 








240 


Asn 


Glu 


His 


His 
245 


Asp 


Phe 


Pro 


Asn 


He 
250 


Pro 


Gly 


Lys 


Ser 


Leu 
255 


Pro 


Leu 


Val 


Arg 


Lys 


He 


Ala 


Ala 


Glu 


Tyr 


Tyr 


Asp 


Asn 


Leu 


Pro 


His 








260 










265 










270 


Tyr 


Asn 


Ser 


Trp 
275 


He 


Lys 


Val 


Leu 


Tyr 
280 


Asp 


Phe 


Val 


Met 


Asp 
285 


Asp 


Thr 


He 


Ser 


Pro 
290 


Tyr 


Ser 


Arg 


Met 


Lys 
295 


Arg 


His 


Gin 


Lys 


Gly 
300 


Glu 


Met 


Val 


Leu 


Glu 


+ * * 


He 


Ser 


Leu 


Val 


Pro 


Lys 


Gly 


Phe 


Phe 


Ser 








305 










310 








315 


Lys 


Thr 


Leu 


Asp 


Asp 


Lys 


Met 


Glu 


Phe 


Leu 


His 


Tyr 


* * * 


Thr 


* * ★ 








320 










325 








330 


Asp 


Gin 


* + ★ 


Cys 
335 


Ser 


Glu 


Ala 


Pro 


Leu 
340 


Ala 


Gin 


Phe 


Gin 


Ser 
345 


Lys 


Ser 


Ser 


Val 


He 


Pro 


Arg 


Ser 


Glu 


Ser 


Gly 


Phe 


* * * 


Thr 


Val 










350 










355 








360 
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Ser Leu Thr Leu Tyr Cys Ser Val Ser Leu Thr Gly Asn Leu *** 

365 370 375 

Leu Val Tyr Tyr Arg His *** Gly Cys Phe Thr His Val Cys His 

380 385 390 

Phe lie Ser lie Ser Phe Lys Lys Leu Leu Lys Ser Tyr Phe Ala 

400 405 410 

Arg 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: amino acid (Translation of Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



Tyr 


Leu 


Leu 


Arg 


Pro 


Leu 


Leu 


Pro 


His 


Leu 


Cys 


Ala 


Thr 


He 


Gly 


1 








5 










10 










15 


Ala 


Glu 


Ser 


Phe 


Leu 


Gly 


Leu 


Phe 


Phe 


He 


Val 


Arg 


Phe 


Leu 


Glu 










20 










25 








30 


Ser 


Asn 


Trp 


Phe 


Val 


Trp 


Val 


Thr 


Gin 


Met 


Asn 


His 


He 


Pro 


Met 










35 










40 










45 


His 


He 


Asp 


His 


Asp 


Arg 


Asn 


Met 


Asp 


Trp 


Val 


Ser 


Thr 


Gin 


Leu 










50 










55 










60 


Gin 


Ala 


Thr 


Cys 


Asn 


Val 


His 


Lys 


Ser 


Ala 


Phe 


Asn 


Asp 


Trp 


Phe 










65 










70 






75 


Ser 


Gly 


His 


Leu 


Asn 


Phe 


Gin 


He 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 


Met 








80 










85 










90 


Pro 


Arg 


His 


Asn 


Tyr 


His 


Lys 


Val 


Ala 


Pro 


Leu 


Val 


Gin 


Ser 


Leu 








95 










100 










105 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 


Tyr 


Gin 


Ser 


Lys 


Pro 


Leu 


Leu 


Ser 


Ala 






110 










115 










120 


Phe 


Ala 


Asp 


He 


He 


His 


Ser 


Leu 


Lys 


Glu 


Ser 


Gly 


Gin 










125 










130 








135 


Leu 


Trp 


Leu 


Asp 


Ala 


Tyr 


Leu 


His 


Gin 


+ + + 


Gin 


Gin 


Pro 


Pro 


Cys 


Pro 








140 










145 










150 


Val 


Trp 


Lys 


Lys 


Arg 


Arg 


Lys 


Thr 


Leu 


Glu 


Pro 


Arg 


Gin 


Arg 


Gly 








155 










160 








165 


Ala 


* * * 


Gly 


Thr 


Met 


Pro 


Leu 


* * * 


Phe 


Asn 


Thr 


Gin 


Arg 


Gly 










170 










175 








180 


Leu 


Gly 


Leu 


Gly 


Thr 


★ * * 


Ser 


Leu 


* * * 


Leu 


Lys 


Leu 


Leu 


Pro 


Phe 


He 








185 










190 










195 


Phe 


* * * 


Pro 


Gin 


Phe 


* * + 


Asp 


Pro 


Lys 


Trp 


Gly 


Val 


Asp 


Thr 










200 










205 








210 


Glu 


Val 


Pro 


Arg 


Arg 


Glu 


Gly 


Ala 















215 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: amino acid (Translation of Contig 3506132) 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

5 



Val 


Phe 


Tyr 


Phe 


Gly 


Asn 


Gly 


Trp 


He 


Pro 


Thr 


Leu 


He 


Thr 


Ala 


1 








5 










10 










15 


Phe 


Val 


Leu 


Ala 


Thr 


Ser 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 


Gin 


His 










20 










25 










30 


Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Pro 


Lys 


Trp 


Asn 


His 










35 










40 










45 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 










50 










55 










60 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 










65 










70 








75 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Xxx 









80 85 

20 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 306 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: amino acid (Translation of Contig 3854933) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



35 


Gin 
1 


Gly 


Pro 


Thr 


Pro 
5 


Arg 


Tyr 


Phe 


Thr 


Trp 
10 


Asp 


Glu 


Val 


Ala 


Gin 
15 




Arg 


Ser 


Gly 


Cys 


Glu 


Glu 


Arg 


Trp 


Leu 


Val 


He 


Asp 


Arg 


Lys 


Val 












20 










25 






30 


40 


Tyr 


Asn 


He 


Ser 


Glu 
35 


Phe 


Thr 


Arg 


Arg 


His 
40 


Pro 


Gly 


Gly 


Ser 


Arg 
45 




Val 


He 


Ser 


His 


Tyr 
50 


Ala 


Gly 


Gin 


Asp 


Ala 
55 


Thr 


Asp 


Pro 


Phe 


Val 
60 




Ala 


Phe 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 


Tyr 


Met 


Asn 


Ser 


45 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2511785) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
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(2) INFORMATION FOR SEQ ID NO: 44: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 253538a) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
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Pro 


Leu 


Leu 


Arg 


Ala 


Leu 












415 






Ser 


Gly 


Lys 


Leu 


Trp 


Leu 


30 










430 




Ser 


Pro 


Arg 


Asp 


Thr 


Val 












445 






Gin 


Arg 


Asn 


Asp 


Gly 


Leu 












460 




35 


va J. 


Tyr 


Ala 


Leu 


Leu 


Thr 








475 






lieu 


Leu 


Ser 


Phe 


Phe 


Ser 












490 






Ser 


Trp 


Asp 


Leu 


Pro 


Ser 


40 










505 




Leu 


Pro 


Val 


Pro 


Pro 


Ser 












520 






Pro 


Pro 


Gly 


Val 


Ala 


Leu 












535 




45 


Met 


Gly 


Gly 


Asp 


Gin 


Arg 










550 




Leu 


Ala 


Ala 


Trp 


Ser 


Leu 












565 






Met 


Leu 


Leu 


Gly 


Phe 


lie 


50 










580 




Leu 


Thr 


Leu 


Pro 


Ala 


Trp 












595 






Val 


nxs 


Fro 


Pne 


lie 


Glu 
















55 


Leu 


Pro 


Pro 


Ala 


Ala 


Arg 










625 




Val 


Gin 


Gly 


Pro 


Arg 


Pro 












640 






Pro 


Trp 


Lys 


Ser 


Pro 


Pro 


60 










655 




Gly 


Phe 


His 


Gly 


Pro 


His 












67 0 






Asp 


Leu 


Gly 


Thr 


Lys 


Gly 
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220 










225 


Trp 


Gin 


Pro 


He 
235 


Glu 


Tyr 


Gly 


Lys 


Lys 
240 


Tyr 


Asn 


His 


Gin 
250 


His 


Glu 


Tyr 


Phe 


Phe 
255 


Leu 


lie 


Pro 


Met 
265 


Tyr 


Phe 


Gin 


Tyr 


Gin 
270 


Val 


His 


Lys 


Asn 
280 


Trp 


Val 


Asp 


Leu 


Ala 
285 


lie 


Arg 


Phe 


Phe 
295 


He 


Thr 


Tyr 


He 


Pro 
300 


Ala 


Leu 


Leu 


Phe 
310 


Leu 


Asn 


Phe 


He 


Arg 
315 


Phe 


Val 


Trp 


Val 
325 


Thr 


Gin 


Met 


Asn 


His 
330 


Gin 


Glu 


Ala 


Tyr 
340 


Arg 


Asp 


Trp 


Phe 


Ser 
345 


Cys 


Asn 


Val 


Glu 
355 


Gin 


Ser 


Phe 


Phe 


Asn 
360 


Leu 


Asn 


Phe 


Gin 
370 


He 


Glu 


His 


His 


Leu 
375 


His 


Asn 


Leu 


His 
385 


Lys 


He 


Ala 


Pro 


Leu 
390 


Lys 


His 


Gly 


He 
405 


Glu 


Tyr 


Gin 


Glu 


Lys 
410 


Leu 


Asp 


lie 


He 
420 


Arg 


Ser 


Leu 


Lys 


Lys 
425 


Asp 


Ala 


Tyr 


Leu 
435 


His 


Lys 


* * * 


Ser 


His 
440 


Gly 


Lys 


Gly 


Cys 
450 


Arg 


Trp 


Gly 


Asp 


Gly 
455 


Leu 


Phe 




Gly 
465 


Val 


Ser 


Glu 


Arg 


Leu 
470 


Asp 


Pro 


Met 


Leu 
480 


Asp 


Leu 


Ser 


Pro 


Phe 
485 


Ser 


His 


Leu 


Pro 
495 


His 


Ser 


Thr 


Leu 


Pro 
500 


Leu 


Ser 


Arg 


Gin 
510 


Pro 


Ser 


Ala 


Met 


Ala 
515 


Pro 


Phe 


Phe 


Gin 
525 


Gly 


Ala 


Glu 


Arg 


Trp 
530 


Ser 


Tyr 


Leu 


His 
540 


Ser 


Leu 


Pro 


Leu 


Lys 
545 


Ser 


Met 


Gly 


Leu 
555 


Ala 


Cys 


Glu 


Ser 


Pro 
560 


Gly 


lie 


Thr 


Pro 
570 


Ala 


Leu 


Val 


Leu 


Gin 
575 


Gly 


Ala 


Gly 


Pro 
585 


Ser 


Arg 


Ala 


Gly 


Pro 
590 


Leu 


His 


Ser 


Pro 
600 


* * 


Arg 


Leu 


Pro 


Leu 
605 


Arg 


Pro 


Ala 


Leu 
615 


Leu 


Gin 


Ser 


Ser 


Gly 
620 


Leu 


Ser 


Thr 


Arg 
630 


Gly 


Leu 


Ser 


* ★ * 


Asp 
635 


Ala 


Gly 


Thr 


Ala 
645 


Ser 


Pro 


Asn 


Leu 


Gly 
650 


Pro 


His 


His 


** ★ 
660 


Ser 


Ala 


Leu 


Thr 


Leu 
665 


Ser 


Thr 


Ala 


Ser 
675 


Pro 


Thr 


* + * 


Ala 


Cys 
680 


Gly 


Val 


Pro 


Arg 


Leu 


Leu 


* * + 


Leu 


Ser 
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685 690 695 

Arg Gly Ser Gly His Val Gin Gly Gly Ala Gly Trp Pro Gly Gly 

700 705 710 

Ser Ala His Pro Pro Ala Phe Pro Gin Gly Val Leu Arg Ser Lys 

5 715 720 725 

He Leu Glu Gin Ser Asp Pro Ser Pro Lys Ala Leu Leu Ser Ala 

730 735 740 

Gly Gin Cys Gin Pro He Pro Gly His Leu Ala Pro Gly Asp Val 

745 750 755 

10 Gly Pro Xxx 



(2) IN FORMAT I ON FOR SEQ ID NO: 45: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



20 



45 



60 



65 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



(ii) MOLECULE TYPE: peptide 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



55 



Tyr 


Val 


Thr 


Pro 


Phe 


Gin 


Thr 


Arg 


Ser 


Trp 


Tyr 


His 


Lys 


Tyr 


Gin 


1 








5 










10 










15 


His 


He 


Tyr 


Ala 


Pro 


Leu 


Leu 


Tyr 


Gly 


He 


Tyr 


Thr 


Leu 


Lys 


Tyr 










20 










25 








30 


Arg 


Thr 


Gin 


Asp 


Trp 
35 


Glu 


Ala 


Phe 


Val 


Lys 
40 


Asp 


Gly 


Lys 


Asn 


Gly 
45 


Ala 


He 


Arg 


Val 


Ser 
50 


Val 


Ala 


Thr 


Asn 


Phe 
55 


Asp 


Lys 


Ala 


Ala 


Tyr 
60 


Val 


He 


Gly 


Lys 


Leu 
65 


Ser 


Phe 


Val 


Phe 


Phe 
70 


Arg 


Phe 


He 


L u 


Pro 
75 


Leu 


Arg 


Tyr 


His 


Ser 
80 


Phe 


Thr 


Asp 


Leu 


He 
85 


Cys 


Tyr 


Phe 


Leu 


He 
90 


Ala 


Glu 


Phe 


Val 


Phe 
95 


Gly 


Trp 


Tyr 


Leu 


Thr 
100 


He 


Asn 


Phe 


Gin 


Val 
105 



60 



25 CGTATGTCAC TCCATTCCAA ACTCGTTCAT GGTAT CATAA ATATCAACAC ATTTACGCTC 

CACTCCTCTA TGGTATTTAC ACACTCAAAT ATCGTACTCA AGATTGGGAA GCTTTTGTAA 120 

AGGATGGTAA AAATGGTGCA ATTCGTGTTA GTGTCGCCAC AAATTTCGAT AAGGCCGCTT 180 

ACGTCATTGG TAAATTGTCT TTTGTTTTCT TCCGTTTCAT CCTTCCACTC CGTTATCATA 240 

GCTTTACAGA TTTAATTTGT TATTTCCTCA TTGCTGAATT CGTCTTTGGT TGGTATCTCA 300 

30 CAATTAATTT CCAAGTTAGT CATGTCGCTG AAGAT CTCAA ATTCTTTGCT ACCCCTGAAA 360 

GACCAGATGA ACCATCTCAA ATCAATGAAG ATTGGGCAAT CCTTCAACTT AAAACTACTC 420 

AAGATTATGG TCATGGTTCA CTCCTTTGTA CCTTTTTTAG TGGTTCTTTA AATCATCAAG 480 

TTGTTCATCA TTTATTCCCA TCAATTGCTC AAGATTTCTA CCCACAACTT GTACCAATTG 54 0 

TAAAAGAAGT TTGTAAAGAA CATAACATTA CTTACCACAT TAAACCAAAC TTCACTGAAG 600 

35 CTATTATGTC ACACATTAAT TACCTTTACA AAATGGGTAA TGATCCAGAT TATGTTAAAA 660 

AACCATTAGC CTCAAAAGAT GATTAAATGA AATAACTTAA AAACCAATTA TTTACTTTTG 720 

ACAAACAGTA ATATTAATAA ATACAA 74 6 

40 (2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 
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Ser 


His 


Val 


Ala 


Glu 
110 


Asp 


Leu 


Lys 


Phe 


Phe 
115 


Ala 


Thr 


Pro 


Glu 


Arg 
120 


Pro 


Asp 


Glu 


Pro 


Ser 


Gin 


He 


Asn 


Glu 


Asp 


Trp 


Ala 


He 


Leu 


Gin 










125 










130 








135 


Leu 


Lys 


Thr 


Thr 


Gin 


Asp 


Tyr 


Gly 


His 


Gly 


Ser 


Leu 


Leu 


Cys 


Thr 










140 










145 








150 


Phe 


Phe 


Ser 


Gly 


Ser 
155 


Leu 


Asn 


His 


Gin 


Val 
160 


Val 


His 


His 


Leu 


Phe 
165 


Pro 


Ser 


lie 


Ala 


Gin 
170 


Asp 


Phe 


Tyr 


Pro 


Gin 
175 


Leu 


Val 


Pro 


He 


Val 
180 


Lys 


Glu 


Val 


Cys 


Lys 


Glu 


His 


Asn 


He 


Thr 


Xyr 


His 


He 


Lys 


Pro 










185 










190 








195 


Asn 


Phe 


Thr 


Glu 


Ala 


lie 


Met 


Ser 


His 


He 


Asn 


Tyr 


Leu 


Tyr 


Lys 










200 










205 






210 


Met 


Gly 


Asn 


Asp 


Pro 
215 


Asp 


Tyr 


Val 


Lys 


Lys 
220 


Pro 


Leu 


Ala 


Ser 


Lys 
225 


Asp 


Asp 


* * + 



























10 



15 



20 (2) INFORMATION FOR SEQ ID NO 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 nucleic acids 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

30 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 47: 

TTTTGGAAGG NTCCAAGTTN ACCACGGANT NGGCAAGTTN ACGGGGCGGA AANCGGTTTT 60 

CCCCCCAAGC CTTTTGTCGA CTGGTTCTGT GGTGGCTTCC AGTACCAAGT CGACCACCAC 120 

TTATTCCCCA GCCTGCCCCG ACACAATCTG GCCAAGACAC ACGCACTGGT CGAATCGTTC 180 

TGCAAGGAGT GGGGTGTCCA GTACCACGAA GCCGACCTCG TGGACGGGAC CATGGAAGTC 24 0 

TTGCACCATT TGGGCAGCGT GGCCGGCGAA TTCGTCGTGG ATTTTGTACG CGACGGACCC 300 

GCCATGTAAT CGTCGTTCGT GACGATGCAA GGGTTCACGC ACATCTACAC ACACTCACTC 360 

ACACAACTAG TGTAACTCGT ATAGAATTCG GTGTCGACCT GGACCTTGTT TGACTGGTTG 420 

4U GGGATAGGGT AGGTAGGCGG ACGCGTGGGT CGNCCCCGGG AATTCTGTGA CCGGTACCTG 4 80 

GCCCGCGTNA AAGT 494 



45 (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



60 



65 



Ser 


Xxx 


Pro 


Arg 


Xxx 
10 


Xxx 


Gin 


Val 


Xxx 


Gly 
15 


Pro 


Pro 


Lys 


Pro 


Phe 
25 


Val 


Asp 


Trp 


Phe 


Cys 
30 


Gin 


Val 


Asp 


His 


His 
40 


Leu 


Phe 


Pro 


Ser 


Leu 
45 


Ala 


Lys 


Thr 


His 


Ala 
55 


Leu 


Val 


Glu 


Ser 


Phe 
60 


Val 


Gin 


Tyr 


His 


Glu 
70 


Ala 


Asp 


Leu 


Val 


Asp 
75 



5 

Phe 

20 
ryr 

35 
Leu 

50 

siy 

65 
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Gly Thr Met Glu Val Leu His His Leu Gly Ser Val Ala Gly Glu 
65 70 75 

Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
80 85 



10 



20 



40 



55 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 nucleic acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



GGATGGAGTT CGTCTGGATC GCTGTGCGCT ACGCGACGTG GTTTAAGCGT CATGGGTGCG 60 

25 CTTGGGTACA CGCCGGGGCA GTCGTTGGGC ATGTACTTGT GCGCCTTTGG TCTCGGCTGC 120 

ATTTACATTT TTCTGCAGTT CGCCGTAAGT CACACCCATT TGCCCGTGAG CAACCCGGAG 180 

GATCAGCTGC ATTGGCTCGA GTACGCGCGG ACCACACTGT GAACAT CAGC ACCAAGTCGT 2 40 

GGTTTGTCAC ATGGTGGATG TCGAACCTCA ACTTTCAGAT CGAGCACCAC CTTTTCCCCA -300 

CGGCGCCCCA GTTCCGTTTC AAGGAGATCA GCCCGCGCGT CGAGGCCCTC TTCAAGCGCC 3 60 

30 ACGGTCTCCC TTACTACGAC ATGCCCTACA CGAGCGCCGT CTCCACCACC TTTGCCAACC 420 

TCTACTCCGT CGGCCATTCC GTCGGCGACG CCAAGCGCGA CTAGCCTCTT TTCCTAGACC 4 80 

TTAATTCCCC ACCCCACCCC ATGTTCTGTC TTCCTCCCGC 520 

35 (2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



50 



Met 


Glu 


Phe 


Val 


Trp 


He 


Ala 


Val 


Arg 


Tyr 


Ala 


Thr 


Trp 


Phe 


Lys 


1 








5 










10 










15 


Arg 


His 


Gly 


Cys 


Ala 


Trp 


Val 


His 


Ala 


Gly 


Ala 


Val 


Val 


Gly 


His 










20 










25 








30 


Val 


Leu 


Val 


Arg 


Leu 


Trp 


Ser 


Arg 


Leu 


His 


Leu 


His 


Phe 


Ser 


Ala 


Val 








35 










40 










45 


Arg 


Arg 


Lys 


Ser 


His 


Pro 


Phe 


Ala 


Arg 


Glu 


Gin 


Pro 


Gly Gly 


Ser 


Ala 






50 










55 










60 


Ala 


Leu 


Ala 


Arg 


Val 


Arg 


Ala 


Asp 


His 


Thr 


Val 


Asn 


He 


Ser 


Thr 






65 










70 










75 


Lys 


Ser 


Trp 
80 


Phe 


Val 


Thr 


Trp 


Trp 
85 


Met 


Ser 


Asn 


Leu 


Asn 
90 


Phe 


Gin 


He 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 


Ala 


Pro 


Gin 


Phe 


Arg 


Phe 








95 










100 










105 


Lys 


Glu 


He 


Ser 


Pro 


Arg 


Val 


Glu 


Ala 


Leu 


Phe 


Lys 


Arg 


His 


Gly 








110 










115 






120 


Leu 


Pro 


Tyr 


Tyr 
125 


Asp 


Met 


Pro 


Tyr 


Thr 
130 


Ser 


Ala 


Val 


Ser 


Thr 
135 


Thr 


Phe 


Ala 


Asn 


Leu 


Tyr 


Ser 


Val 


Gly 


His 


Ser 


Val 


Gly Asp Ala 
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140 145 150 

Lys Arg Asp 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 nucleic acids 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



15 



30 



40 



45 



50 



55 



60 



<ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



ACGCGTCCGC CCACGCGTCC GCCGCGAGCA ACTCATCAAG GAAGGCTACT TTGACCCCTC 60 

20 GCTCCCGCAC ATGACGTACC GCGTGGTCGA GATTGTTGTT CTCTTCGTGC TTTCCTTTTG 120 

GCTGATGGGT CAGTCTTCAC CCCTCGCGCT CGCTCTCGGC ATTGTCGTCA GCGGCATCTC 180 

TCAGGGTCGC TGCGGCTGGG TAATGCATGA GATGGGCCAT GGGTCGTTCA CTGGTGTCAT 240 

TTGGCTTGAC GACCGGTTGT GCGAGTTCTT TTACGGCGTT GGTTGTGGCA TGAGCGGTCA 300 

TTACTGGAAA AACCAGCACA GCAAACACCA CGCAGCGCCA AACCGGCTCG AGCACGATGT 360 

25 AGATCTCAAC ACCTTGCCAT TGGTGGCCTT CAACGAGCGC GTCGTGCGCA AGGTCCGACC 420 



(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



Arg 


Val 


Arg 


Pro 


Arg 


Val 


Arg 


Arg 


Glu 


Gin 


Leu 


He 


Lys 


Glu 


Gly 


1 








5 










10 










15 


Tyr 


Phe 


Asp 


Pro 


Ser 


Leu 


Pro 


His 


Met 


Thr 


Tyr 


Arg 


Val 


Val 


Glu 










20 










25 








30 


lie 


Val 


Val 


Leu 


Phe 


Val 


Leu 


Ser 


Phe 


Trp 


Leu 


Met 


Gly Gin 


Ser 










35 










40 










45 


Ser 


Pro 


Leu 


Ala 


Leu 


Ala 


Leu 


Gly 


He 


Val 


Val 


Ser 


Gly 


He 


Ser 


Gin 








50 










55 








60 


Gly 


Arg 


Cys 


Gly 


Trp 


Val 


Met 


His 


Glu 


Met 


Gly 


His 


Gly 


Ser 


Phe 








65 










70 






75 


Thr 


Gly 


Val 


He 


Trp 


Leu 


Asp 


Asp 


Arg 


Leu 


Cys 


Glu 


Phe 


Phe 










65 










70 








75 


Tyr 


Gly 


Val 


Gly 


Cys 


Gly 


Met 


Ser 


Gly 


His 


Tyr 


Trp 


Lys 


Asn 


Gin 


His 








80 










85 






90 


Ser 


Lys 


His 


His 


Ala 


Ala 


Pro 


Asn 


Arg 


Leu 


Glu 


His 


Asp 


Val 










95 










100 








105 


Asp 


Leu 


Asn 


Thr 


Leu 


Pro 


Leu 


Val 


Ala 


Phe 


Asn 


Glu 


Arg 


Val 


Val 










110 










115 








120 


Arg 


Lys 


Val 


Arg 


Pro 
125 
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What is claimed is : 

1 . A nucleic acid construct comprising: 

One or more nucleotide sequences depicted in a SEQ ID NO: selected from 
the group consisting of SEQ ID NO:l, SEQ ID NO:3 and SEQ ID NO:5, 
5 wherein said one or more nucleotide sequences is linked to a heterologous 

nucleotide sequence. 

2. A nucleic acid construct comprising: 

One or more nucleotide sequences depicted in a SEQ ID NO: selected from 
1 0 the group consisting of SEQ ID NO: 1 , SEQ ID NO:3 and SEQ ID NO:5, 

wherein said one or more nucleotide sequences is operably associated with an 
expression control sequence functional in a plant cell 

3. The nucleic acid construct according to claim 2, wherein said nucleotide 
1 5 sequence has an average A + T content of less than about 60%. 

4. The nucleic acid construct according to claim 2, wherein said nucleotide 
sequence is derived from a fungus. 



20 5. The nucleic acid construct according to claim 4, wherein said fungus is of 

the genus Mortierella. 



6. The nucleic acid construct according to claim 5, wherein said fungus is of 
the species alpina. 



25 



7. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an amino 
acid sequence depicted in SEQ ID NO:2, wherein said nucleotide sequence is 
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operably associated with a transcription or an expression control sequence 
function in a plant cell, wherein said nucleotide sequence encodes a functionally 
active polypeptide which desaturates a fatty acid molecule at carbon 6 from the 
carboxyl end of said fatty acid molecule. 

8. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an 
amino acid sequence depicted in SEQ ID NO:4, wherein said nucleotide 
sequence is operably associated with a transcription or an expression control 
sequence functional in a plant cell, wherein said nucleotide sequence encodes a 
functionally active polypeptide which desaturates a fatty acid molecule at 
carbon 12 from the carboxyl end of said fatty acid molecule. 

9. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an 
amino acid sequence depicted in SEQ ID NO:6, wherein said nucleotide 
sequence is operably associated with a transcription or an expression control 
sequence function in a plant cell, wherein said nucleotide sequence encodes a 
functionally active polypeptide which desaturates a fatty acid molecule at 
carbon 5 from the carboxyl end of said fatty acid moleculle. 

10. A nucleic acid construct comprising: 

at least one nucleotide sequence which encodes a functionally active 
desaturase having an amino acid sequence depicted in a SEQ ID NO: selected 
from the group consisting of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6, 
wherein said nucleotide sequence is operably associated with a promoter 
functional in a plant cell. 
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11. The nucleic acid construct according to claim 10, wherein said plant cell is a 
seed cell. 



12. The nucleic acid construct according to claim 11, wherein said seed cell is 
an embryo cell. 



13. A recombinant plant cell comprising: 

At least one copy of a DNA sequence which encodes at least one 
functionally active Mortierella alpina fatty acid desaturase which results in the 

10 production of a polyunsaturated fatty acid, wherein said fatty acid desaturase 

has an amino acid sequence as depicted in a SEQ ID NO: selected from the 
group consisting of SEQ ID NO:2, SEQ ID NO:4, and SEQ ID NO:6, wherein 
said cell was transformed with a vector comprising said DNA sequence, and 
wherein said DNA sequence is operably associated with an expression control 

15 sequence. 



14. The recombinant plant cell of claim 13, wherein said polyunsaturated fatty 
acid is selected from the group consisting of LA, AR A, GLA, DGLA, SDA 
and EPA. 



20 



15. The recombinant plant cell of claim 13, wherein said recombinant plant cell 
is enriched in a fatty acid selected from the group consisting of 18: 1, 18:2, 
18:3 and 18:4. 



25 16. The recombinant plant cell of claim 15, wherein said plant cell is selected 

from the group consisting of Brassica, soybean, saf flower, corn, flax, and 
sunflower. 
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17. The recombinant plant cell according to claim 16, wherein said expression 
control sequence is endogenous to said plant cell. 



18. One or more plant oils expressed by said recombinant plant cell of claim 16. 

19. A method for obtaining altered long chain polyunsaturated fatty acid 
biosynthesis comprising the steps of: 

growing a plant having cells which contain a transgene encoding a 
transgene expression product which desaturates a fatty acid molecule at carbon 
5 from the carboxyl end of said fatty acid molecule, wherein said transgene is 
operably associated with an expression control sequence, under conditions 
whereby said transgene is expressed, whereby long chain polyunsaturated fatty 
acid biosynthesis in said cells is altered. 



20. A method for obtaining altered long chain polyunsaturated fatty acid 
biosynthesis comprising the steps of: 

growing a plant having cells which contain one or more transgenes, 
derived from a fungus or algae, which encodes a transgene expression product 
which desaturates a fatty acid molecule at a carbon selected from the group 
consisting of carbon 5, carbon 6 and carbon 12 from the carboxyl end of said 
fatty acid molecule, wherein said one or more transgenes is operably associated 
with an expression control sequence, under conditions whereby said one or 
more transgenes is expressed, whereby long chain polyunsaturated fatty acid 
biosynthesis in said cells is altered. 

21. The method according to claims 19 or 20, wherein said long chain 
polyunsaturated fatty acid is selected from the group consisting of LA, ARA, 
GLA, DGLA, SDA and EPA. 
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22. A plant oil or fraction thereof produced according to the method of claims 
19 or 20. 



23. A method of treating or preventing malnutrition comprising administering 
said plant oil of claim 22 to a patient in need of said treatment or prevention 
in an amount sufficient to effect said treatment or prevention. 

24. A pharmaceutical composition comprising said plant oil or fraction of claim 
22 and a pharmaceutically acceptable carrier. 



25. The pharmaceutical composition of claim 24, wherein said pharmaceutical 
composition is in the form of a solid or a liquid. 



26. The pharmaceutical composition of claim 25, wherein said pharmaceutical 
15- composition is in a capsule or tablet form. 



27. The pharmaceutical composition of claim 24 further comprising at least one 
nutrient selected from the group consisting of a vitamin, a mineral, a 
carbohydrate, a sugar, an amino acid, a free fatty acid, a phospholipid, an 
20 antioxidant, and a phenolic compound. 



28. A nutritional formula comprising said plant oil or fraction thereof of claim 
22. 



25 29. The nutritional formula of claim 28, wherein said nutritional formula is 

selected from the group consisting of an infant formula, a dietary 
supplement, and a dietary substitute. 
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30. The nutritional formula of claim 29, wherein said infant formula, dietary 
supplement or dietary supplement is in the form of a liquid or a solid. 

31. An infant formula comprising said plant oil or fraction thereof of claim 22. 

5 

32. The infant formula of claim 31 further comprising at least one macronutrient 
selected from the group consisting of coconut oil, soy oil, canola oil, mono- 
and diglycerides, glucose, edible lactose, electrodialysed whey, 
electrodialysed skim milk, milk whey, soy protein, and other protein 

10 hydrolysates. 

33. The infant formula of claim 32 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 

*5 magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 

chloride, iodine, selenium, and iron. 

34. A dietary supplement comprising said plant oil or fraction thereof of claim 
22. 

20 

35. The dietary supplement of claim 34 further comprising at least one 
macronutrient selected from the group consisting of coconut oil, soy oil, 
canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 

25 hydrolysates. 

36. The dietary supplement of claim 35 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 
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magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

37. The dietary supplement of claim 34 or claim 36, wherein said dietary 
5 supplement is administered to a human or an animal. 

38. A dietary substitute comprising said plant oil or fraction thereof of claim 22. 

39. The dietary substitute of claim 38 further comprising at least one 

10 macronutrient selected from the group consisting of coconut oil, soy oil, 

canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

15 40. The dietary substitute of claim 39 further comprising at least one vitamin 

selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 
magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

20 

41. The dietary substitute of claim 38 or claim 40, wherein said dietary 
substitute is administered to a human or animal. 

42. A method of treating a patient having a condition caused by insuffient 

25 intake or production of polyunsaturated fatty acids comprising administering 

to said patient said dietary substitute of claim 38 or said dietary supplement 
of claim 34 in an amount sufficient to effect said treatment. 
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43. The method of claim 42, wherein said dietary substitute or said dietary 
supplement is administered enterally or parenterally. 

44. A cosmetic comprising said plant oil or fraction thereof of claim 22. 

5 

45. The cosmetic of claim 44, wherein said cosmetic is applied topically. 

46. The pharmaceutical composition of claim 24, wherein said pharmaceutical 
composition is administered to a human or an animal. 

10 

47. An animal feed comprising said plant oil or fraction thereof of claim 22. 

48. An isolated nucleotide sequence comprising the nucleotide sequence 
selected from the group consisting of SEQ ID NO:38 - SEQ ID NO:44 

15 wherein said nucleotide sequence is expressed in a plant cell. 

49. The method of claim 20 wherein said fungus is Mortierella species. 

50. The method of claim 49 wherein said fungus is Mortierella alpina. 

20 

5 1. An isolated nucleotide sequence selected from the group consisting of SEQ 
ID NO:49 - SEQ ID NO: 50 wherein said sequence is expressed in a plant 
cell. 
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FastA Match of ma29 and contig 253538a 

SCORES Initl: 117 Initn: 225 Opt: 256 

Smith-Waterman score: 408; 27.0% identity in 441 aa overlap 
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FastA Match of ma524 and contig 253538a 



SCORES Initl: 231 Initn: 499 Opt: 401 

Smith-Waterman score: 620; 27.3% identity in 455 aa overlap 
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METHODS AND COMPOSITIONS FOR SYNTHESIS OF 
LONG CHAIN POLYUNSATURATED FATTY ACIDS IN PLANTS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of USSN 08/834,655, filed 
5 April 1 1 , 1997, and a continuation in part of USSN 08/833,610, filed April 11, 
1997, USSN 08/834,033 filed April 11, 1997 and USSN 08/956,985 filed 
October 24, 1997 which disclosures are incorporated herein by reference. 

INTRODUCTION 

Field of the Invention 

10 This invention relates to modulating levels of enzymes and/or enzyme 

components capable of altering the production of long chain polyunsaturated 
fatty acids (PUFAS) in a host plant. The invention is exemplified by the 
production of PUFAS in plants. 

Background 

15 Two main families of polyunsaturated fatty acids (PUFAs) are the co3 

fatty acids, exemplified by arachidonic acid, and the 006 fatty acids, exemplified 
by eicosapentaenoic acid. PUFAs are important components of the plasma 
membrane of the cell, where they may be found in such forms as phospholipids. 
PUFAs also serve as precursors to other molecules of importance in human 

20 beings and animals, including the prostacyclins, leukotrienes and 

prostaglandins. PUFAs are necessary for proper development, particularly in 
the developing infant brain, and for tissue formation and repair. 

Four major long chain PUFAs of importance include docosahexaenoic 

acid (DHA) and eicosapentaenoic acid (EPA), which are primarily found in 

25 different types of fish oil, gamma-linolenic acid (GLA), which is found in the 

seeds of a number of plants, including evening primrose (Oenothera biennis)^ 

borage (Borago officinalis) and black currants (Ribes nigrum), and stearidonic 

acid (SDA), which is found in marine oils and plant seeds. Both GLA and 

another important long chain PUFA, arachidonic acid (ARA), are found in 

-1- 
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filamentous fungi. ARA can be purified from animal tissues including liver and 
adrenal gland. 

For DHA, a number of sources exist for commercial production 
including a variety of marine organisms, oils obtained from cold water marine 
5 fish, and egg yolk fractions. For ARA, microorganisms including the genera 
Mortierella, Entomophthora, Phytium and Porphyridium can be used for 
commercial production. Commercial sources of SDA include the genera 
Trichodesma and Echium. Commercial sources of GLA include evening 
primrose, black currants and borage. However, there are several disadvantages 

1 0 associated with commercial production of PUFAs from natural sources. Natural 
sources of PUFAs, such as animals and plants, tend to have highly 
heterogeneous oil compositions. The oils obtained from these sources therefore 
can require extensive purification to separate out one or more desired PUFAs or 
to produce an oil which is enriched in one or more PUFA. Natural sources also 

1 5 are subject to uncontrollable fluctuations in availability. Fish stocks may 
undergo natural variation or may be depleted by overfishing. Fish oils have 
unpleasant tastes and odors, which may be impossible to economically separate 
from the desired product, and can render such products unacceptable as food 
supplements. Animal oils, and particularly fish oils, can accumulate 

20 environmental pollutants. Weather and disease can cause fluctuation in yields 
from both fish and plant sources. Cropland available for production of alternate 
oil-producing crops is subject to competition from the steady expansion of 
human populations and the associated increased need for food production on the 
remaining arable land. Crops which do produce PUFAs, such as borage, have 

25 not been adapted to commercial growth and may not perform well in 

monoculture. Growth of such crops is thus not economically competitive where 
more profitable and better established crops can be grown. Large scale 
fermentation of organisms such as Mortierella is also expensive. Natural 
animal tissues contain low amounts of ARA and are difficult to process. 

30 Microorganisms such as Porphyridium and Mortierella are difficult to cultivate 
on a commercial scale. 

-2- 
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Dietary supplements and pharmaceutical formulations containing 
PUFAs can retain the disadvantages of the PUFA source. Supplements such as 
fish oil capsules can contain low levels of the particular desired component and 
thus require large dosages. High dosages result in ingestion of high levels of 
5 undesired components, including contaminants. Care must be taken in 

providing fatty acid supplements, as overaddition may result in suppression of 
endogenous biosynthetic pathways and lead to competition with other necessary 
fatty acids in various lipid fractions in vivo, leading to undesirable results. For 
example, Eskimos having a diet high in ©3 fatty acids have an increased 
10 tendency to bleed (U.S. Pat. No. 4,874,603). Unpleasant tastes and odors of the 
supplements can make such regimens undesirable, and may inhibit compliance 
by the patient. 

A number of enzymes are involved in PUFA biosynthesis. Linoleic acid 
(LA, 18:2 A9, 12) is produced from oleic acid (18:1 A9) by a A12-desaturase. 

15 GLA (18:3 A6, 9, 12) is produced from linoleic acid (LA, 18:2 A9, 12) by a A6- 
desaturase. ARA (20:4 A5, 8, 11, 14) production from DGLA (20:3 A8, 11, 14) 
is catalyzed by a AS-desaturase. However, animals cannot desaturate beyond 
the A9 position and therefore cannot convert oleic acid (18:1 A9) into linoleic 
acid (18:2 A9, 12). Likewise, a-linolenic acid (ALA, 18:3 A9, 12, 15) cannot 

20 be synthesized by mammals. Other eukaryotes, including fungi and plants, have 
enzymes which desaturate at positions A21 and A15. The major poly- 
unsaturated fatty acids of animals therefore are either derived from diet and/or 
from desaturation and elongation of linoleic acid (18:2 A9, 12) or oc-linolenic 
acid (18:3 A9, 12, 15). 

25 Poly-unsaturated fatty acids are considered to be useful for nutritional, 

pharmaceutical, industrial, and other purposes. An expansive supply of poly- 
unsaturated fatty acids from natural sources and from chemical synthesis are not 
sufficient for commercial needs. Therefore it is of interest to obtain genetic 
material involved in PUFA biosynthesis from species that naturally produce 

30 these fatty acids and to express the isolated material alone or in combination in 
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a heterologous system which can be manipulated to allow production of 
commercial quantities of PUFAS. 

The present invention is further directed to formulas, dietary 
supplements or dietary supplements in the form of a liquid or a solid containing 
5 the long chain fatty acids of the invention. These formulas and supplements 
may be administered to a human or an animal. 

The formulas and supplements of the invention may further comprise at 
least one macronutrient selected from the group consisting of coconut oil, soy 
oil, canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
1 0 whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

The formulas of the present invention may further include at least one 
vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
1 5 calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

The present invention is further directed to a method of treating a patient 
having a condition caused by insuffient intake or production of polyunsaturated 
fatty acids comprising administering to the patient a dietary substitute of the 
20 invention in an amount sufficient to effect treatment of the patient. 

The present invention is further directed to cosmetic and pharmaceutical 
compositions of the material of the invention. 

The present invention is further directed to transgenic oils in 
pharmaceutically acceptable carriers. The present invention is further directed 
to nutritional supplements, cosmetic agents and infant formulae containing 
transgenic oils. 

The present invention is further directed to a method for obtaining 
altered long chain polyunsaturated fatty acid biosynthesis comprising the steps 
of: growing a microbe having cells which contain a transgene which encodes a 
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transgene expression product which desaturates a fatty acid molecule at carbon 
5,5 or 12 from the carboxyl end of said fatty acid molecule, wherein the 
trangene is operably associated with an expression control sequence, under 
conditions whereby the transgene is expressed, whereby long chain 
5 polyunsaturated fatty acid biosynthesis in the cells is altered. 

The present invention is further directed toward pharmaceutical 
compositions comprising at least one nutrient selected from the group consisting 
of a vitamin, a mineral, a carbohydrate, a sugar, an amino acid, a free fatty acid, 
a phospholipid, an antioxidant, and a phenolic compound. 

10 Relevant Literature 

Production of gamma-linolenic acid by a A6-desaturase is described in 
USPN 5,552,306 and USPN 5,614,393. Production of 8, 1 1-eicosadienoic acid 
using Mortierella alpina is disclosed in USPN 5,376,541. Production of 
docosahexaenoic acid by dinoflagellates is described in USPN 5,407,957. 

1 5 Cloning of a A6-desaturase from borage is described in PCT publication WO 
96/21022. Cloning of A9-desaturases is described in the published patent 
applications PCT WO 91/13972, EP 0 550 162 Al, EP 0 561 569 A2, EP 0 644 
263 A2, and EP 0 736 598 Al, and in USPN 5,057,419. Cloning of A12- 
desaturases from various organisms is described in PCT publication WO 

20 94/1 1 5 1 6 and USPN 5,443,974. Cloning of Al 5-desaturases from various 

organisms is described in PCT publication WO 93/1 1245. A A6 palmitoyl-acyl 
carrier protein desaturase from Thumbergia alata and its expression in E. coli is 
described in USPN 5,614,400. Expression of a soybean stearyl-ACP desaturase 
in transgenic soybean embryos using a 35S promoter is disclosed in USPN 

25 5,443,974. 

SUMMARY OF THE INVENTION 

Novel compositions and methods are provided for preparation of poly- 
unsaturated long chain fatty acids and desaturases in plants and plant cells. The 
methods involve growing a host plant cell of interest transformed with an 
30 expression cassette functional in a host plant cell, the expression cassette 
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comprising a transcriptional and translational initiation regulatory region, joined 
in reading frame 5 f to a DNA sequence encoding a desaturase polypeptide 
capable of modulating the production of PUFAs. Expression of the desaturase 
polypeptide provides for an alteration in the PUFA profile of host plant cells as 
5 a result of altered concentrations of enzymes involved in PUFA biosynthesis. 
Of particular interest is the selective control of PUFA production in plant tissues 
and/or plant parts such as leaves, roots, fruits and seeds. The invention finds 
use for example in the large scale production of DHA, EPA, ARA, and GLA 
and for modification of the fatty acid profile of edible plant tissues and/or plant 
10 parts. 

The present invention further includes a purified nucleotide sequence or 
polypeptide sequence that is substantially related or homologous to the 
nucleotide and peptide sequences presented in SEQ ID NO:l - SEQ ID NO:52. 
The present invention is further directed to methods of using the sequences 
1 5 presented in SEQ ID NO: 1 to SEQ ID NO:40 as probes to identify related 

sequences, as components of expression systems and as components of systems 
useful for producing transgenic oil. 

BRIEF DESCRI PTION OF THE DRAWINGS 

Figure 1 shows possible pathways for the synthesis of arachidonic acid 
20 (20:4 A5, 8, 1 1, 14) and stearidonic acid (18:4 A6, 9, 12, 15) from palmitic acid 
(Ci 6 ) from a variety of organisms, including algae, Mortierella and humans. 
These PUFAs can serve as precursors to other molecules important for humans 
and other animals, including prostacyclins, leukotrienes, and prostaglandins, 
some of which are shown. 

25 Figure 2 shows possible pathways for production of PUFAs in addition 

to ARA, including EPA and DHA, again compiled from a variety of organisms. 

Figure 3A-E shows the DNA sequence (SEQ ID NO:l) of the 
Mortierella alpina A6 desaturase and the deduced amino acid sequence (SEQ 
ID NO:2). 
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Figure 4 shows an alignment of the Mortierella alpina A6 desaturase 
amino acid sequence with other A6 desaturases and related sequences (SEQ ID 
NOS:7, 8, 9, 10, 11, 12 and 13). 

Figure 5A-D shows the DNA sequence of the Mortierella alpina A12 
5 desaturase (SEQ ID NO:3) and the deduced amino acid sequence (SEQ ID 
NO:4) 

Figure 6 shows the deduced amino acid sequence (SEQ ID NO: 14) of 
the PCR fragment (see Example 1). 

Figure 7A-D shows the DNA sequence of the Mortierella alpina A5 
10 desaturase (SEQ ID NO:5). 

Figure 8 shows alignments of the protein sequence of the A5 desaturase 
(SEQ ID NO:6) with A6 desaturases and related sequences (SEQ ID NOS:15, 
16, 17, 18). 

Figure 9 shows alignments of the protein sequence of the Ma 29 and 
15 contig 253538a. 

Figure 10 shows alignments of the protein sequence of Ma 524 and 
contig 253538a. 

BRIEF DES CRIPTION OF THE SEQUENCE LISTINGS 
SEQ ID NO: 1 shows the DNA sequence of the Mortierella alpina A6 
desaturase. 

SEQ ID NO:2 shows the amino acid sequence of the Mortierella alpina 
A6 desaturase. 

SEQ ID NO:3 shows the DNA sequence of the Mortierella alpina A12 
desaturase. 

SEQ ID NO:4 shows the amino acid sequence of the Mortierella alpina 
A12 desaturase. 
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SEQ ID NO:5 shows the DNA sequence of the Mortierella alpina A5 
desaturase. 

SEQ ID NO:6 shows the amino acid sequence Mortierella alpina A5 
desaturase. 

5 SEQ ID NO:7 - SEQ ID NO: 13 show amino acid sequences that relate 

to Mortierella alpina A6 desaturase. 

SEQ ID NO: 14 shows an amino acid sequence of a PCR fragment of 
Example 1. 

SEQ ID NO: 15 - SEQ ID NO: 18 show amino acid sequences that relate 
1 0 to Mortierella alpina A5 and A6 desaturases. 

SEQ ID NO:19 - SEQ ID NO:30 show PCR primer sequences. 

SEQ ID NO:31 - SEQ ID NO:37 show human nucleotide sequences. 

SEQ ID NO:38 - SEQ ID NO:44 show human peptide sequences. 

SEQ ID NO:45 - SEQ ID NO:46 show the nucleotide and amino acid 
1 5 sequence of a Dictyostelium discoideium desaturase. 

SEQ ID NO:47 - SEQ ID NO:50 show the nucleotide and deduced 
amino acid sequence of a Schizochytrium cDNA clone. 

DESCRIPTI ON OF THE PREFERRED EMBODIMENTS 

In order to ensure a complete understanding of the invention, the 
20 following definitions are provided: 

A5-Desaturase: A5 desaturase is an enzyme which introduces a double 
bond between carbons 5 and 6 from the carboxyl end of a fatty acid molecule. 

A6-Desaturase: A6-desaturase is an enzyme which introduces a double 
bond between carbons 6 and 7 from the carboxyl end of a fatty acid molecule. 

25 A9-Desaturase: A9-desaturase is an enzyme which introduces a double 

bond between carbons 9 and 10 from the carboxyl end of a fatty acid molecule. 

-8- 
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A12-Desaturase: A12-desaturase is an enzyme which introduces a 
double bond between carbons 12 and 13 from the carboxyl end of a fatty acid 
molecule. 

Fatty Acids: Fatty acids are a class of compounds containing a long 
5 hydrocarbon chain and a terminal carboxylate group. Fatty acids include the 
following: 



Fatty Acid 


12:0 


lauric acid 




16:0 


palmitic acid 




16:1 


palmitoleic acid 




18:0 


stearic acid 




18:1 


oleic acid 


A9-18:l 


18:2 A5,9 


taxoleic acid 


A5,9-18:2 


18:2 A6,9 


6,9-octadecadienoic acid 


A6,9-18:2 


1 CO 


linoleic acid 


A9,12-18:2 (LA) 


18:3 A6,9,12 


gamma-linolenic acid 


A6,9, 12-18:3 (GLA) 


18:3 A5,9,12 


pinolenic acid 


A5,9,12-18:3 


1 O.J 


alpna-linolenic acid 


A9,12,15-18:3 (ALA) 


18:4 


stearidonic acid 


A6,9,12,15-18:4 (SDA) 


20:0 


Arachidic acid 




20:1 


Eicoscenic Acid 




22:0 


behehic acid 




22:1 


erucic acid 




22:2 


Docasadienoic acid 




20:4 0)6 


arachidonic acid 


A5,8, 11,14-20:4 (ARA) 


20:3 0)6 


o)6-eicosatrienoic 
dihomo-gamma linolenic 


A8, 11,14-20:3 (DGLA) 


20:5 0)3 


Eicosapentanotc 
(Timnodonic acid) 


A5,8,U, 14,17-20:5 (EPA) 


20:3 0)3 


o)3-eicosatrienoic 


Al 1,16,17-20:3 


20:4 0)3 


o)3 -eicosatetraenoic 


A8,l 1,14,17-20:4 


22:5 0)3 


Docosapentaenoic 


A7,10,13,16,19-22:5 (o)3DPA) 


22:6 0)3 


Docosahexaenoic 
(cervonic acid) 


A4,7, 10, 13, 16, 19-22:6 (DHA) 


24:0 


Lignoceric acid 
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Taking into account these definitions, the present invention is directed to novel 
DNA sequences, DNA constructs, methods and compositions are provided 
which permit modification of the poly-unsaturated long chain fatty acid content 
of plant cells. Plant cells are transformed with an expression cassette 
5 comprising a DNA encoding a polypeptide capable of increasing the amount of 
one or more PUFA in a plant cell. Desirably, integration constructs may be 
prepared which provide for integration of the expression cassette into the 
genome of a host cell. Host cells are manipulated to express a sense or 
antisense DNA encoding a polypeptide(s) that has desaturase activity. By 

1 0 "desaturase" is intended a polypeptide which can desaturate one or more fatty 
acids to produce a mono- or poly-unsaturated fatty acid or precursor thereof of 
interest. By "polypeptide" is meant any chain of amino acids, regardless of 
length or post-translational modification, for example, glycosylation or 
phosphorylation. The substrate(s) for the expressed enzyme may be produced 

15 by the host cell or may be exogenously supplied. 

To achieve expression in a host cell, the transformed DNA is operably 
associated with transcriptional and translational initiation and termination 
regulatory regions that are functional in the host cell. Constructs comprising the 
gene to be expressed can provide for integration into the genome of the host cell 

20 or can autonomously replicate in the host cell. For production of linoleic acid 
(LA), the expression cassettes generally used include a cassette which provides 
for A12 desaturase activity, particularly in a host cell which produces or can 
take up oleic acid. For production of ALA, the expression cassettes generally 
used include a cassette which provides for A15 or co3 desaturase activity, 

25 particularly in a host cell which produces or can take up LA. For production of 
GLA or SDA, the expression cassettes generally used include a cassette which 
provides for A6 desaturase activity, particularly in a host cell which produces or 
can take up LA or ALA, respectively. Production of ©6-type unsaturated fatty 
acids, such as LA or GLA, is favored in a plant capable of producing ALA by 

30 inhibiting the activity of a A15 or a>3 type desaturase; this is accomplished by 
providing an expression cassette for an antisense A15 or co3 transcript, or by 
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disrupting a A15 or co3 desaturase gene. Similarly, production of LA or ALA is 
favored in a plant having A6 desaturase activity by providing an expression 
cassette for an antisense A6 transcript, or by disrupting a A6 desaturase gene. 
Production of oleic acid likewise is favored in a plant having A12 desaturase 
5 activity by providing an expression cassette for an antisense A12 transcript, or 
by disrupting a A12 desaturase gene. For production of ARA, the expression 
cassette generally used provides for A5 desaturase activity, particularly in a host 
cell which produces or can take up DGLA. Production of co6-type unsaturated 
fatty acids, such as ARA, is favored in a plant capable of producing ALA by 
1 0 inhibiting the activity of a Al 5 or co3 type desaturase; this is accomplished by 
providing an expression cassette for an antisense A15 or co3 transcript, or by 
disrupting a A15 or ©3 desaturase gene. 

TRANSGENIC PLANT PRODUCTION OF FATTY ACIDS 

Transgenic plant production of PUFAs offers several advantages over 
purification from natural sources such as fish or plants. Production of fatty 
acids from recombinant plants provides the ability to alter the naturally 
occurring plant fatty acid profile by providing new synthetic pathways in the 
host or by suppressing undesired pathways, thereby increasing levels of desired 
PUFAs, or conjugated forms thereof, and decreasing levels of undesired 
PUFAs. Production of fatty acids in transgenic plants also offers the advantage 
that expression of desaturase genes in particular tissues and/or plant parts means 
that greatly increased levels of desired PUFAs in those tissues and/or parts can 
be achieved, making recovery from those tissues more economical. For 
example, the desired PUFAs can be expressed in seed; methods of isolating 
seed oils are well established. In addition to providing a source for purification 
of desired PUFAs, seed oil components can be manipulated through expression 
of desaturase genes, either alone or in combination with other genes such as 
elongases, to provide seed oils having a particular PUFA profile in concentrated 
form. The concentrated seed oils then can be added to animal milks and/or 
synthetic or semi-synthetic milks to serve as infant formulas where human 

-11- 



BNSDOCID: <WO 9846764A1> 



15 



20 



25 



30 



WO 98/46764 PCT/US98/07421 



nursing is impossible or undesired, or in cases of malnourishment or disease in 
both adults and infants. 

For production of PUFAs, depending upon the host cell, the availability 
of substrate, and the desired end produces), several polypeptides, particularly 
5 desaturases, are of interest including those polypeptides which catalyze the 
conversion of stearic acid to oleic acid, LA to GLA, of ALA to SDA, of oleic 
acid to LA, or of LA to ALA, which includes enzymes which desaturate at the 
A6, A9, A 12, A15 or ©3 positions. Considerations for choosing a specific 
polypeptide having desaturase activity include the pH optimum of the 

1 0 polypeptide, whether the polypeptide is a rate limiting enzyme or a component 
thereof, whether the desaturase used is essential for synthesis of a desired poly- 
unsaturated fatty acid, and/or co-factors required by the polypeptide. The 
expressed polypeptide preferably has parameters compatible with the 
biochemical environment of its location in the host cell. For example, the 

1 5 polypeptide may have to compete for substrate with other enzymes in the host 
cell. Analyses of the K m and specific activity of the polypeptide in question 
therefore are considered in determining the suitability of a given polypeptide for 
- modifying PUFA production in a given host cell. The polypeptide used in a 
particular situation therefore is one which can function under the conditions 

20 present in the intended host cell but otherwise can be any polypeptide having 
desaturase activity which has the desired characteristic of being capable of 
modifying the relative production of a desired PUFA. A scheme for the 
synthesis of arachidonic acid (20:4 A5, 8, 1 1, 14) from palmitic acid (C, 6 ) is 
shown in Figure 1 . A key enzyme in this pathway is a A5-desaturase which 

25 converts DH-y-linolenic acid (DGLA, eicosatrienoic acid) to ARA. Conversion 
of a-linolenic acid (ALA) to stearidonic acid by a A6-desaturase is also shown. 
Production of PUFAs in addition to ARA, including EPA and DHA is shown in 
Figure 2. A key enzyme in the synthesis of arachidonic acid (20:4 A5, 8, 1 1, 
14) from stearic acid (C| 8 ) is a A6-desaturase which converts the linoleic acid 

30 into y-linolenic acid. Conversion of a-linolenic acid (ALA) to stearidonic acid 
by a A6-desaturase also is shown. For production of ARA, the DNA sequence 
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used encodes a polypeptide having A5 desaturase activity. In particular 
instances, this can be coupled with an expression cassette which provides for 
production of a polypeptide having A6 desaturase activity and, optionally, a 
transcription cassette providing for production of antisense sequences to a A 15 
5 transcription product. The choice of combination of cassettes used depends in 
part on the PUFA profile of the host cell. Where the host cell A5-desaturase 
activity is limiting, overexpression of A5 desaturase alone generally will be 
sufficient to provide for enhanced ARA production. 

SOURCES OF POLYPEPTIDES 
1 0 HAVING DESATURASE ACTIVITY 

As sources of polypeptides having desaturase activity and 

oligonucleotides encoding such polypeptides are organisms which produce a 

desired poly-unsaturated fatty acid. As an example, microorganisms having an 

ability to produce ARA can be used as a source of A5-desaturase genes; 

1 5 microorganisms which GLA or SD A can be used as a source of A6-desaturase 
and/or A12-desaturase genes. Such microorganisms include, for example, those 
belonging to the genera Mortierella, Conidiobolus, Pythium, Phytophathora, 
Penicillium, Porphyridium, Coidosporium, Mucor, Fusarium, Aspergillus, 
Rhodotorula, and Entomophthora. Within the genus Porphyridium, of 

20 particular interest is Porphyridium cruentum. Within the genus Mortierella, of 
particular interest are Mortierella elongata, Mortierella exigua, Mortierella 
hygrophila, Mortierella ramanniana, var. angulispora, and Mortierella alpina. 
Within the genus Mucor, of particular interest are Mucor circinelloides and 
Mucor javanicus. 

25 DNAs encoding desired desaturases can be identified in a variety of 

ways. As an example, a source of the desired desaturase, for example genomic 
or cDNA libraries from Mortierella, is screened with detectable enzymatically- 
or chemically-synthesized probes, which can be made from DNA, RNA, or non- 
naturally occurring nucleotides, or mixtures thereof. Probes may be 

30 enzymatically synthesized from DNAs of known desaturases for normal or 
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reduced-stringency hybridization methods. Oligonucleotide probes also can be 
used to screen sources and can be based on sequences of known desaturases, 
including sequences conserved among known desaturases, or on peptide 
sequences obtained from the desired purified protein. Oligonucleotide probes 
5 based on amino acid sequences can be degenerate to encompass the degeneracy 
of the genetic code, or can be biased in favor of the preferred codons of the 
source organism. Oligonucleotides also can be used as primers for PCR from 
reverse transcribed mRNA from a known or suspected source; the PCR product 
can be the full length cDNA or can be used to generate a probe to obtain the 
1 0 desired full length cDNA. Alternatively, a desired protein can be entirely 

sequenced and total synthesis of a DNA encoding that polypeptide performed. 

Once the desired genomic or cDNA has been isolated, it can be 
sequenced by known methods. It is recognized in the art that such methods are 
subject to errors, such that multiple sequencing of the same region is routine and 

15 is still expected to lead to measurable rates of mistakes in the resulting deduced 
sequence, particularly in regions having repeated domains, extensive secondary 
structure, or unusual base compositions, such as regions with high GC base 
content. When discrepancies arise, resequencing can be done and can employ 
special methods. Special methods can include altering sequencing conditions 

20 by using: different temperatures; different enzymes; proteins which alter the 
ability of oligonucleotides to form higher order structures; altered nucleotides 
such as ITP or methylated dGTP; different gel compositions, for example 
adding formamide; different primers or primers located at different distances 
from the problem region; or different templates such as single stranded DNAs. 

25 Sequencing of mRNA can also be employed. 

For the most part, some or all of the coding sequence for the polypeptide 
having desaturase activity is from a natural source. In some situations, 
however, it is desirable to modify all or a portion of the codons, for example, to 
enhance expression, by employing host preferred codons. Host preferred 
30 codons can be determined from the codons of highest frequency in the proteins 
expressed in the largest amount in a particular host species of interest. Thus, the 
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coding sequence for a polypeptide having desaturase activity can be 
synthesized in whole or in part. All or portions of the DNA also can be 
synthesized to remove any destabilizing sequences or regions of secondary 
structure which would be present in the transcribed mRNA. All or portions of 
5 the DNA also can be synthesized to alter the base composition to one more 
preferable in the desired host cell. Methods for synthesizing sequences and 
bringing sequences together are well established in the literature. In vitro 
mutagenesis and selection, site-directed mutagenesis, or other means can be 
employed to obtain mutations of naturally occurring desaturase genes to 
10 produce a polypeptide having desaturase activity in vivo with more desirable 
physical and kinetic parameters for function in the host cell, such as a longer 
half-life or a higher rate of production of a desired polyunsaturated fatty acid. 

Desirable cDNAs have less than 60% A+T composition, preferably less 
than 50% A+T composition. On a localized scale of a sliding window of 20 
1 5 base pairs, it is preferable that there are no localized regions of the cDNA with 
greater than 75% A+T composition; with a window of 60 base pairs, it is 
preferable that there are no localized regions of the cDNA with greater than 
60%, more preferably no localized regions with greater than 55% A+T 
composition. 

20 Mortierella giving Desaturases 

Of particular interest are the Mortierella alpina A5-desaturase, A6- 
desaturase and A12-desaturase. The A5-desaturase has 446 amino acids; the 
amino acid sequence is shown in Figure 7. The gene encoding the Mortierella 
alpina A5-desaturase can be expressed in transgenic microorganisms to effect 

25 greater synthesis of ARA from DGLA. Other DNAs which are substantially 
identical in sequence to the Mortierella alpina A5-desaturase DNA, or which 
encode polypeptides which are substantially identical in sequence to the 
Mortierella alpina A5-desaturase polypeptide, also can be used. The 
Mortierella alpina A6-desaturase, has 457 amino acids and a predicted 

30 molecular weight of 5 1 .8 kD; the amino acid sequence is shown in Figure 3. 
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The gene encoding the Mortierella alpina A6-desaturase can be expressed in 
transgenic plants or animals to effect greater synthesis of GLA from linoleic 
acid or of stearidonic acid (SD A) from ALA. Other DN As which are 
substantially identical in sequence to the Mortierella alpina A6-desaturase 
5 DNA, or which encode polypeptides which are substantially identical in 

sequence to the Mortierella alpina A6-desaturase polypeptide, also can be used. 

The Mortierella alpina A12-desaturase has the amino acid sequence 
shown in Figure 5. The gene encoding the Mortierella alpina A12-desaturase 
can be expressed in transgenic plants to effect greater synthesis of LA from 
10 oleic acid. Other DNAs which are substantially identical to the Mortierella 
alpina A12-desaturase DNA, or which encode polypeptides which are 
substantially identical to the Mortierella alpina A12-desaturase polypeptide, 
also can be used. 

By substantially identical in sequence is intended an amino acid 

1 5 sequence or nucleic acid sequence exhibiting in order of increasing preference 
at least 60%, 80%, 90% or 95% homology to the Mortierella alpina A5- 
desaturase amino acid sequence or nucleic acid sequence encoding the amino 
acid sequence. For polypeptides, the length of comparison sequences generally 
is at least 16 amino acids, preferably at least 20 amino acids, or most preferably 

20 35 amino acids. For nucleic acids, the length of comparison sequences 

generally is at least 50 nucleotides, preferably at least 60 nucleotides, and more 
preferably at least 75 nucleotides, and most preferably, 110 nucleotides. 
Homology typically is measured using sequence analysis software, for example, 
the Sequence Analysis software package of the Genetics Computer Group, 

25 University of Wisconsin Biotechnology Center, 1710 University Avenue, 
Madison, Wisconsin 53705, MEGAlign (DNAStar, Inc., 1228 S. Park St., 
Madison, Wisconsin 53715), and MacVector (Oxford Molecular Group, 2105 S. 
Bascom Avenue, Suite 200, Campbell, California 95008). Such software 
matches similar sequences by assigning degrees of homology to various 

30 substitutions, deletions, and other modifications. Conservative substitutions 

typically include substitutions within the following groups: glycine and alanine; 
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valine, isoleucine and leucine; aspartic acid, glutamic acid, asparagine, and 
glutamine; serine and threonine; lysine and arginine; and phenylalanine and 
tyrosine. Substitutions may also be made on the basis of conserved 
hydrophobic^ or hydrophilicity (Kyte and Doolittle, J. Mol Biol 157: 105- 
5 132, 1982), or on the basis of the ability to assume similar polypeptide 
secondary structure (Chou and Fasman, Adv. Enzymol 47: 45-148, 1978). 

Other Desaturases 
Encompassed by the present invention are related desaturases from the 
same or other organisms. Such related desaturases include variants of the 

10 disclosed A5-, A6- and A12-desaturases that occur naturally within the same or 
different species of Mortierella, as well as homologues of the disclosed A5- 
desaturase from other species and evolutionarily related protein having 
desaturase activity. Also included are desaturases which, although not 
substantially identical to the Mortierella alpina A5-desaturase, desaturate a fatty 

15 acid molecule at carbon 5, 6 or 12, respectively, from the carboxyl end of a fatty 
acid molecule. Related desaturases can be identified by their ability to function 
substantially the same as the disclosed desaturases; that is, are still able to 
effectively convert DGLA to ARA, LA to GLA, ALA to SDA or oleic acid to 
LA. Related desaturases also can be identified by screening sequence databases 

20 for sequences homologous to the disclosed desaturase, by hybridization of a 

probe based on the disclosed desaturase to a library constructed from the source 
organism, or by RT-PCR using mRNA from the source organism and primers 
based on the disclosed desaturase. Such desaturases includes those from 
humans, Dictyostelium discoideum and Phaeodactylum tricornum. 

25 The regions of a desaturase polypeptide important for desaturase activity 

can be determined through routine mutagenesis, expression of the resulting 
mutant polypeptides and determination of their activities. Mutants may include 
deletions, insertions and point mutations, or combinations thereof. A typical 
functional analysis begins with deletion mutagenesis to determine the N- and C- 

30 terminal limits of the protein necessary for function, and then internal deletions, 
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insertions or point mutants are made to further determine regions necessary for 
function. Other techniques such as cassette mutagenesis or total synthesis also 
can be used. Deletion mutagenesis is accomplished, for example, by using 
exonucleases to sequentially remove the 5' or 3' coding regions. Kits are 
5 available for such techniques. After deletion, the coding region is completed by 
ligating oligonucleotides containing start or stop codons to the deleted coding 
region after 5' or 3' deletion, respectively. Alternatively, oligonucleotides 
encoding start or stop codons are inserted into the coding region by a variety of 
methods including site-directed mutagenesis, mutagenic PCR or by ligation 

10 onto DNA digested at existing restriction sites. Internal deletions can similarly 
be made through a variety of methods including the use of existing restriction 
sites in the DNA, by use of mutagenic primers via site directed mutagenesis or 
mutagenic PCR. Insertions are made through methods such as linker-scanning 
mutagenesis, site-directed mutagenesis or mutagenic PCR. Point mutations are 

1 5 made through techniques such as site-directed mutagenesis or mutagenic PCR. 

Chemical mutagenesis can also be used for identifying regions of a 
desaturase polypeptide important for activity. A mutated construct is expressed, 
and the ability of the resulting altered protein to function as a desaturase is 
assayed. Such structure-function analysis can determine which regions may be 
20 deleted, which regions tolerate insertions, and which point mutations allow the 
mutant protein to function in substantially the same way as the native 
desaturase. All such mutant proteins and nucleotide sequences encoding them 
are within the scope of the present invention. 

EXPRESSION OF DESATURASE GENES 
25 Once the DNA encoding a desaturase polypeptide has been obtained, it 

is placed in a vector capable of replication in a host cell, or is propagated in 
vitro by means of techniques such as PCR or long PCR. Replicating vectors 
can include plasmids, phage, viruses, cosmids and the like. Desirable vectors 
include those useful for mutagenesis of the gene of interest or for expression of 
30 the gene of interest in host cells. The technique of long PCR has made in vitro 
propagation of large constructs possible, so that modifications to the gene of 
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interest, such as mutagenesis or addition of expression signals, and propagation 
of the resulting constructs can occur entirely in vitro without the use of a 
replicating vector or a host cell. 

For expression of a desaturase polypeptide, functional transcriptional 
5 and translational initiation and termination regions are operably linked to the 
DNA encoding the desaturase polypeptide. Transcriptional and translational 
initiation and termination regions are derived from a variety of nonexclusive 
sources, including the DNA to be expressed, genes known or suspected to be 
capable of expression in the desired system, expression vectors, chemical 

1 0 synthesis, or from an endogenous locus in a host cell. Expression in a plant 
tissue and/or plant part presents certain efficiencies, particularly where the 
tissue or part is one which is easily harvested, such as seed, leaves, fruits, 
flowers, roots, etc. Expression can be targeted to that location within the plant 
by using specific regulatory sequences, such as those of USPN 5,463,174, 

15 USPN 4,943,674, USPN 5,106,739, USPN 5,175,095, USPN 5,420,034, USPN 
5,188,958, and USPN 5,589,379. Alternatively, the expressed protein can be an 
enzyme which produces a product which may be incorporated, either directly or 
upon further modifications, into a fluid fraction from the host plant. In the 
present case, expression of desaturase genes, or antisense desaturase transcripts, 

20 can alter the levels of specific PUF As, or derivatives thereof, found in plant 
parts and/or plant tissues. The A5-desaturase polypeptide coding region is 
expressed either by itself or with other genes, in order to produce tissues and/or 
plant parts containing higher proportions of desired PUFAs or in which the 
PUF A composition more closely resembles that of human breast milk (Prieto et 

25 al , PCT publication WO 95/24494). The termination region can be derived 
from the y region of the gene from which the initiation region was obtained or 
from a different gene. A large number of termination regions are known to and 
have been found to be satisfactory in a variety of hosts from the same and 
different genera and species. The termination region usually is selected more as 

30 a matter of convenience rather than because of any particular property. 
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The choice of a host cell is influenced in part by the desired PUFA 
profile of the transgenic cell, and the native profile of the host cell. As an 
example, for production of linoleic acid from oleic acid, the DNA sequence 
used encodes a polypeptide having A12 desaturase activity, and for production 
5 of GLA from linoleic acid, the DNA sequence used encodes a polypeptide 
having A6 desaturase activity. Use of a host cell which expresses A12 
desaturase activity and lacks or is depleted in A15 desaturase activity, can be 
used with an expression cassette which provides for overexpression of A6 
desaturase alone generally is sufficient to provide for enhanced GLA production 

10 in the transgenic cell. Where the host cell expresses A9 desaturase activity, 
expression of both a Al 2- and a A6-desaturase can provide for enhanced GLA 
production. In particular instances where expression of A6 desaturase activity is 
coupled with expression of A12 desaturase activity, it is desirable that the host 
cell naturally have, or be mutated to have, low A15 desaturase activity. 

1 5 Alternatively, a host cell for A6 desaturase expression may have, or be mutated 
to have, high A12 desaturase activity. 

Expression in a host cell can be accomplished in a transient or stable 
fashion. Transient expression can occur from introduced constructs which 
contain expression signals functional in the host cell, but which constructs do 
20 not replicate and rarely integrate in the host cell, or where the host cell is not 
proliferating. Transient expression also can be accomplished by inducing the 
activity of a regulatable promoter operably linked to the gene of interest, 
although such inducible systems frequently exhibit a low basal level of 
expression. Stable expression can be achieved by introduction of a construct 
that can integrate into the host genome or that autonomously replicates in the 
host cell. Stable expression of the gene of interest can be selected for through 
the use of a selectable marker located on or transfected with the expression 
construct, followed by selection for cells expressing the marker. When stable 
expression results from integration, integration of constructs can occur 
30 randomly within the host genome or can be targeted through the use of 

constructs containing regions of homology with the host genome sufficient to 
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target recombination with the host locus. Where constructs are targeted to an 
endogenous locus, all or some of the transcriptional and translational regulatory 
regions can be provided by the endogenous locus. 

When increased expression of the desaturase polypeptide in the source 
5 plant is desired, several methods can be employed. Additional genes encoding 
the desaturase polypeptide can be introduced into the host organism. 
Expression from the native desaturase locus also can be increased through 
homologous recombination, for example by inserting a stronger promoter into 
the host genome to cause increased expression, by removing destabilizing 
1 0 sequences from either the mRNA or the encoded protein by deleting that 
information from the host genome, or by adding stabilizing sequences to the 
mRNA (see USPN 4,910,141 and USPN 5,500,365.) 

When it is desirable to express more than one different gene, appropriate 
regulatory regions and expression methods, introduced genes can be propagated 

15 in the host cell through use of replicating vectors or by integration into the host 
genome. Where two or more genes are expressed from separate replicating 
vectors, it is desirable that each vector has a different means of replication. 
Each introduced construct, whether integrated or not, should have a different 
means of selection and should lack homology to the other constructs to maintain 

20 stable expression and prevent reassortment of elements among constructs. 
Judicious choices of regulatory regions, selection means and method of 
propagation of the introduced construct can be experimentally determined so 
that all introduced genes are expressed at the necessary levels to provide for 
synthesis of the desired products. 

25 Constructs comprising the gene of interest may be introduced into a host 

cell by standard techniques. These techniques include transfection, infection, 
holistic impact, electroporation, microinjection, scraping, or any other method 
which introduces the gene of interest into the host cell (see USPN 4,743,548, 
USPN 4,795,855, USPN 5,068,193, USPN 5,188,958, USPN 5,463,174, USPN 

30 5,565,346 and USPN 5,565,347). For convenience, a host cell which has been 
manipulated by any method to take up a DNA sequence or construct will be 
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referred to as "transformed" or "recombinant" herein. The subject host will 
have at least have one copy of the expression construct and may have two or 
more, depending upon whether the gene is integrated into the genome, 
amplified, or is present on an extrachromosomal element having multiple copy 
numbers. 

The transformed host cell can be identified by selection for a marker 
contained on the introduced construct. Alternatively, a separate marker 
construct may be introduced with the desired construct, as many transformation 
techniques introduce many DNA molecules into host cells. Typically 
transformed hosts are selected for their ability to grow on selective media 
Selective media may incorporate an antibiotic or lack a factor necessary for 
growth of the untransformed host, such as a nutrient or growth factor. An 
mtroduced marker gene therefor may confer antibiotic resistance, or encode an 
essential growth factor or enzyme, and permit growth on selective media when 
expressed in the transformed host cell. Desirably, resistance to kanamycin and 
the amino glycoside G418 are of interest (see USPN 5,034,322). Selection of a 
transformed host can also occur when the expressed marker protein can be 
detected, either directly or indirectly. The marker protein may be expressed 
alone or as a fusion to another protein. The marker protein can be detected by 
its enzymatic activity; for example 0 galactosidase can convert the substrate X- 
gal to a colored product, and luciferase can convert luciferin to a light-emitting 
product. The marker protein can be detected by its light-producing or 
modxfying characteristics; for example, the green fluorescent protein of 
Ae q uorea victoria fluoresces when illuminated with blue light. Antibodies can 
be used to detect the marker protein or a molecular tag on, for example, a 
protein of interest. Cells expressing the marker protein or tag can be selected 
for example, visually, or by techniques such as FACS or panning using 
antibodies. 

11* PUFAs produced using the subject methods and compositions may 
be found ,„ the host p,am tissue and/or p la „, pa* as free ( m acids or in 
conjugated forms such as ac ylgly cerols, phospholipids, sulfohpids or 
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glycolipids, and may be extracted from the host cell through a variety of means 
well-known in the art. Such means may include extraction with organic 
solvents, sonication, supercritical fluid extraction using for example carbon 
dioxide, and physical means such as presses, or combinations thereof. Of 
5 particular interest is extraction with hexane or methanol and chloroform. Where 
desirable, the aqueous layer can be acidified to protonate negatively charged 
moieties and thereby increase partitioning of desired products into the organic 
layer. After extraction, the organic solvents can be removed by evaporation 
under a stream of nitrogen. When isolated in conjugated forms, the products are 
1 0 enzymatically or chemically cleaved to release the free fatty acid or a less 

complex conjugate of interest, and are then subjected to further manipulations to 
produce a desired end product. Desirably, conjugated forms of fatty acids are 
cleaved with potassium hydroxide. 

PURIFICATION OF FATTY ACIDS 
1 5 If further purification is necessary, standard methods can be employed. 

Such methods include extraction, treatment with urea, fractional crystallization, 
HPLC, fractional distillation, silica gel chromatography, high speed 
centrifugation or distillation, or combinations of these techniques. Protection of 
reactive groups, such as the acid or alkenyl groups, may be done at any step 
20 through known techniques, for example alkylation or iodination. Methods used 
include methylation of the fatty acids to produce methyl esters. Similarly, 
protecting groups may be removed at any step. Desirably, purification of 
fractions containing ARA, DHA and EPA is accomplished by treatment with 
urea and/or fractional distillation. 

25 USES OF FATTY ACIDS 

The uses of the fatty acids of subject invention are several. Probes based 
on the DNAs of the present invention may find use in methods for isolating 
related molecules or in methods to detect organisms expressing desaturases. 
When used as probes, the DNAs or oligonucleotides need to be detectable. This 

30 is usually accomplished by attaching a label either at an internal site, for 
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example via incorporation of a modified residue, or at the 5' or 3' terminus. 
Such labels can be directly detectable, can bind to a secondary molecule that is 
detectably labeled, or can bind to an unlabeled secondary molecule and a 
detectably labeled tertiary molecule; this process can be extended as long as is 
5 practical to achieve a satisfactorily detectable signal without unacceptable levels 
of background signal. Secondary, tertiary, or bridging systems can include use 
of antibodies directed against any other molecule, including labels or other 
antibodies, or can involve any molecules which bind to each other, for example 
a biotin-streptavidin/avidin system. Detectable labels typically include 

1 0 radioactive isotopes, molecules which chemically or enzymatically produce or 
alter light, enzymes which produce detectable reaction products, magnetic 
molecules, fluorescent molecules or molecules whose fluorescence or light- 
emitting characteristics change upon binding. Examples of labelling methods 
can be found in USPN 5,01 1,770. Alternatively, the binding of target molecules 

15 can be directly detected by measuring the change in heat of solution on binding 
of probe to target via isothermal titration calorimetry, or by coating the probe or 
target on a surface and detecting the change in scattering of light from the 
surface produced by binding of target or probe, respectively, as may be done 
with the BIAcore system. 

20 PUFAs of the subject invention produced by recombinant means find 

applications in a wide variety of areas. Supplementation of humans or animals 
with PUFAs in various forms can result in increased levels not only of the 
added PUFAs, but of their metabolic progeny as well. For example, where the 
inherent A6-desaturase pathway is dysfunctional in an individual, treatment with 

25 GL A can result not only in increased levels of GLA, but also of downstream 
products such as ARA and prostaglandins (see Figure 1). Complex regulatory 
mechanisms can make it desirable to combine various PUFAs, or to add 
different conjugates of PUFAs, in order to prevent, control or overcome such 
mechanisms to achieve the desired levels of specific PUFAs in an individual. 

30 PUFAs, or derivatives thereof, made by the disclosed method can be 

used as dietary supplements, particularly in infant formulas, for patients 
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undergoing intravenous feeding or for preventing or treating malnutrition. 
Particular fatty acids such as EPA are used to alter the composition of infant 
formulas to better replicate the PUFA composition of human breast milk. The 
predominant triglyceride in human milk has been reported to be l,3-di-oleoyl-2- 
palmitoyl, with 2-palmitoyl glycerides reported as better absorbed than 2-oleoyl 
or 2-lineoyl glycerides (USPN 4,876,107). Typically, human breast milk has a 
fatty acid profile comprising from about 0. 1 5 % to about 0.36 % as DHA, from 
about 0.03 % to about 0.13 % as EPA, from about 0.30 % to about 0.88 % as 
ARA, from about 0.22 % to about 0.67 % as DGLA, and from about 0.27 % to 
about 1.04 % as GLA. A preferred ratio of GLA:DGLA:ARA in infant 
formulas is from about 1 : 1 :4 to about 1:1:1, respectively. Amounts of oils 
providing these ratios of PUFA can be determined without undue 
experimentation by one of skill in the art. PUFAs, or host cells containing 
them, also can be used as animal food supplements to alter an animal's tissue or 
1 5 milk fatty acid composition to one more desirable for human or animal 
consumption. 

NUTRITIONAL COMPOSITIONS 

The present invention also includes nutritional compositions. Such 
compositions, for purposes of the present invention, include any food or 
20 preparation for human consumption including for enteral or parenteral 

consumption, which when taken into the body (a) serve to nourish or build up 
tissues or supply energy and/or (b) maintain, restore or support adequate 
nutritional status or metabolic function. 

The nutritional composition of the present invention comprises at least 
one oil or acid produced in accordance with the present invention and may 
either be in a solid or liquid form. Additionally, the composition may include 
edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amount of such ingredients will vary depending on whether the 
composition is intended for use with normal, healthy infants, children or adults 
having specialized needs such as those which accompany certain metabolic 
conditions (e.g., metabolic disorders). 
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Examples of macronutrients which may be added to the composition 
include but are not limited to edible fats, carbohydrates and proteins. Examples 
of such edible fats include but are not limited to coconut oil, soy oil, and mono- 
and diglycerides. Examples of such carbohydrates include but are not limited to 
5 glucose, edible lactose and hydrolyzed search. Additionally, examples of 
proteins which may be utilized in the nutritional composition of the invention 
include but are not limited to soy proteins, electrodialysed whey , 
electrodialysed skim milk, milk whey, or the hydrolysates of these proteins. 

With respect to vitamins and minerals, the following may be added to 
10 the nutritional compositions of the present invention: calcium, phosphorus, 
potassium, sodium, chloride, magnesium, manganese, iron, copper, zinc, 
selenium, iodine, and Vitamins A, E, D, C, and the B complex. Other such 
vitamins and minerals may also be added. 

The components utilized in the nutritional compositions of the present 
15 invention will of semi-purified or purified origin. By semi-purified or purified 
is meant a material which has been prepared by purification of a natural 
material or by synthesis. 

Examples of nutritional compositions of the present invention include 
but are not limited to infant formulas, dietary supplements, and rehydration 
20 compositions. Nutritional compositions of particular interest include but are not 
limited to those utilized for enteral and parenteral supplementation for infants, 
specialist infant formulae, supplements for the elderly, and supplements for 
those with gastrointestinal difficulties and/or malabsorption. 

Nutritional Compositions 

25 A typical nutritional composition of the present invention will contain 

edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amounts of such ingredients will vary depending on whether the 
formulation is intended for use with normal, healthy individuals temporarily 
exposed to stress, or to subjects having specialized needs due to certain chronic 

30 or acute disease states (e.g., metabolic disorders). It will be understood by 
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persons skilled in the art that the components utilized in a nutritional 
formulation of the present invention are of semi-purified or purified origin. By 
semi-purified or purified is meant a material that has been prepared by 
purification of a natural material or by synthesis. These techniques are well 
5 known in the art (See, e.g., Code of Federal Regulations for Food Ingredients 
and Food Processing; Recommended Dietary Allowances, 10 th Ed., National 
Academy Press, Washington, D.C., 1989). 

In a preferred embodiment, a nutritional formulation of the present 
invention is an enteral nutritional product, more preferably an adult or child 
10 enteral nutritional product. Accordingly in a further aspect of the invention, a 
nutritional formulation is provided that is suitable for feeding adults or children 
who are experiencing stress. The formula comprises, in addition to the PUFAs 
of the invention; macronutrients, vitamins and minerals in amounts designed to 
provide the daily nutritional requirements of adults. 

!5 The macronutritional components include edible fats, carbohydrates and 

proteins. Exemplary edible fats are coconut oil, soy oil, and mono- and 
diglycerides and the PUFA oils of this invention. Exemplary carbohydrates are 
glucose, edible lactose and hydrolyzed cornstarch. A typical protein source 
would be soy protein, electrodialysed whey or electrodialysed skim milk or milk 

20 whey, or the hydrolysates of these proteins, although other protein sources are 
also available and may be used. These macronutrients would be added in the 
form of commonly accepted nutritional compounds in amount equivalent to 
those present in human milk or an energy basis, i.e., on a per calorie basis. 

Methods for formulating liquid and enteral nutritional formulas are well 
25 known in the art and are described in detail in the examples. 

The enteral formula can be sterilized and subsequently utilized on a 
ready-to-feed (RTF) basis or stored in a concentrated liquid or a powder. The 
powder can be prepared by spray drying the enteral formula prepared as 
indicated above, and the formula can be reconstituted by rehydrating the 
30 concentrate. Adult and infant nutritional formulas are well known in the art and 
commercially available (e.g., Similac®, Ensure®, Jevity® and Alimentum® 
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from Ross Products Division, Abbott Laboratories). An oil or acid of the 
present invention can be added to any of these formulas in the amounts 
described below. 

The energy density of the nutritional composition when in liquid form, 
can typically range from about 0.6 Kcal to 3 Kcal per ml. When in solid or 
powdered form, the nutritional supplement can contain from about 1.2 to more 
than 9 Kcals per gm, preferably 3 to 7 Kcals per gm. In general, the osmolality 
of a liquid product should be less than 700 mOsm and more preferably less than 
660 mOsm. 

The nutritional formula would typically include vitamins and minerals, 
in addition to the PUFAs of the invention, in order to help the individual ingest 
the minimum daily requirements for these substances. In addition to the PUFAs 
listed above, it may also be desirable to supplement the nutritional composition 
with zinc, copper, and folic acid in addition to antioxidants. It is believed that 
these substances will also provide a boost to the stressed immune system and 
thus will provide further benefits to the individual. The presence of zinc, 
copper or folic acid is optional and is not required in order to gain the beneficial 
effects on immune suppression. Likewise a pharmaceutical composition can be 
supplemented with these same substances as well. 

In a more preferred embodiment, the nutritional contains, in addition to 
the antioxidant system and the PUFA component, a source of carbohydrate 
wherein at least 5 weight % of said carbohydrate is an indigestible 
oligosaccharide. In yet a more preferred embodiment, the nutritional 
composition additionally contains protein, taurine and carnitine. 

The PUFAs, or derivatives thereof, made by the disclosed method can 
be used as dietary substitutes, or supplements, particularly infant formulas, for 
patients undergoing intravenous feeding or for preventing or treating 
malnutrition. Typically, human breast milk has a fatty acid profile comprising 
from about 0.15 % to about 0.36 % as DHA, from about 0.03 % to about 0.13 % 
as EPA, from about 0.30 % to about 0.88 % as ARA, from about 0.22 % to 
about 0.67 % as DGLA, and from about 0.27 % to about 1 .04 »/„ as GLA 
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Additionally, the predominant triglyceride in human milk has been reported to 
be l,3-di-oleoyl-2-palmitoyl, with 2-palmitoyl glycerides reported as better 
absorbed than 2-oleoyl or 2-lineoyl glycerides (USPN 4,876,107). Thus, fatty 
acids such as ARA, DGLA, GLA and/or EPA produced by the invention can be 
5 used to alter the composition of infant formulas to better replicate the PUFA 
composition of human breast milk. In particular, an oil composition for use in a 
pharmacologic or food supplement, particularly a breast milk substitute or 
supplement, will preferably comprise one or more of ARA, DGLA and GLA. 
More preferably the oil will comprise from about 0.3 to 30% ARA, from about 
1 0 0.2 to 30% DGLA, and from about 0.2 to about 30% GLA. 

In addition to the concentration, the ratios of ARA, DGLA and GLA can 
be adapted for a particular given end use. When formulated as a breast milk 
supplement or substitute, an oil composition which contains two or more of 
ARA, DGLA and GLA will be provided in a ratio of about 1:19:30 to about 

15 6:1 :0.2, respectively. For example, the breast milk of animals can vary in ratios 
of ARA:DGLA:DGL ranging from 1:19:30 to 6:1:0.2, which includes 
intermediate ratios which are preferably about 1:1:1, 1:2:1, 1:1:4. When 
produced together in a host cell, adjusting the rate and percent of conversion of 
a precursor substrate such as GLA and DGLA to ARA can be used to precisely 

20 control the PUFA ratios. For example, a 5% to 10% conversion rate of DGLA 
to ARA can be used to produce an ARA to DGLA ratio of about 1:19, whereas 
a conversion rate of about 75% to 80% can be used to produce an ARA to 
DGLA ratio of about 6:1. Therefore, whether in a cell culture system or in a 
host animal, regulating the timing, extent and specificity of desaturase 

25 expression as described can be used to modulate the PUFA levels and ratios. 
Depending on the expression system used, e.g., cell culture or an animal 
expressing oil(s) in its milk, the oils also can be isolated and recombined in the 
desired concentrations and ratios. Amounts of oils providing these ratios of 
PUFA can be determined following standard protocols. PUFAs, or host cells 

30 containing them, also can be used as animal food supplements to alter an 

animal's tissue or milk fatty acid composition to one more desirable for human 
or animal consumption. 
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For dietary supplementation, the purified PUFAs, or derivatives thereof, 
may be incorporated into cooking oils, fats or margarines formulated so that in 
normal use the recipient would receive the desired amount. The PUFAs may 
also be incorporated into infant formulas, nutritional supplements or other food 
5 products, and may find use as anti-inflammatory or cholesterol lowering agents. 

Pharmaceutical Compositions 
The present invention also encompasses a pharmaceutical composition 
comprising one or more of the acids and/or resulting oils produced in 
accordance with the methods described herein. More specifically, such a 

1 0 pharmaceutical composition may comprise one or more of the acids and/or oils 
as well as a standard, well-known, non-toxic pharmaceutically acceptable 
carrier, adjuvant or vehicle such as, for example, phosphate buffered saline, 
water, ethanol, polyols, vegetable oils, a wetting agent or an emulsion such as a 
water/oil emulsion. The composition may be in either a liquid or solid form. 

1 5 For example, the composition may be in the form of a tablet, capsule, ingestible 
liquid or powder, injectible, or topical ointment or cream. 

Possible routes of administration include, for example, oral, rectal and 
parenteral. The route of administration will, of course, depend upon the desired 
effect. For example, if the composition is being utilized to treat rough, dry, or 
20 aging skin, to treat injured or burned skin, or to treat skin or hair affected by a 
disease or condition, it may perhaps be applied topically. 

The dosage of the composition to be administered to the patient may be 
determined by one of ordinary skill in the art and depends upon various factors 
such as weight of the patient, age of the patient, immune status of the patient, 
25 etc. 

With respect to form, the composition may be, for example, a solution, a 
dispersion, a suspension, an emulsion or a sterile powder which is then 
reconstituted. 
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Additionally, the composition of the present invention may be utilized 
for cosmetic purposes. It may be added to pre-existing cosmetic compositions 
such that a mixture is formed or may be used as a sole composition. 

Pharmaceutical compositions may be utilized to administer the PUFA 
5 component to an individual. Suitable pharmaceutical compositions may 

comprise physiologically acceptable sterile aqueous or non-aqueous solutions, 
dispersions, suspensions or emulsions and sterile powders for reconstitution into 
sterile solutions or dispersions for ingestion. Examples of suitable aqueous and 
non-aqueous carriers, diluents, solvents or vehicles include water, ethanol, 

1 0 polyols (propy leneglycol, polyethyleneglycol, glycerol, and the like), suitable 
mixtures thereof, vegetable oils (such as olive oil) and injectable organic esters 
such as ethyl oleate. Proper fluidity can be maintained, for example, by the 
maintenance of the required particle size in the case of dispersions and by the 
use of surfactants. It may also be desirable to include isotonic agents, for 

1 5 example sugars, sodium chloride and the like. Besides such inert diluents, the 
composition can also include adjuvants, such as wetting agents, emulsifying and 
suspending agents, sweetening, flavoring and perfuming agents. 

Suspensions, in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
20 polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 

aluminum metahydroxide, bentonite, agar-agar and tragacanth or mixtures of 
these substances, and the like. 

Solid dosage forms such as tablets and capsules can be prepared using 
techniques well known in the art. For example, PUFAs of the invention can be 

25 tableted with conventional tablet bases such as lactose, sucrose, and cornstarch 
in combination with binders such as acacia, cornstarch or gelatin, disintegrating 
agents such as potato starch or alginic acid and a lubricant such as stearic acid 
or magnesium stearate. Capsules can be prepared by incorporating these 
excipients into a gelatin capsule along with the antioxidants and the PUFA 

30 component. The amount of the antioxidants and PUFA component that should 
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be incorporated into the pharmaceutical formulation should fit within the 
guidelines discussed above. 

As used in this application, the term "treat" refers to either preventing, or 
reducing the incidence of, the undesired occurrence. For example, to treat 
5 immune suppression refers to either preventing the occurrence of this 

suppression or reducing the amount of such suppression. The terms "patient" 
and "individual" are being used interchangeably and both refer to an animal. 
The term "animal" as used in this application refers to any warm-blooded 
mammal including, but not limited to, dogs, humans, monkeys, and apes. As 
1 0 used in the application the term "about" refers to an amount varying from the 
stated range or number by a reasonable amount depending upon the context of 
use. Any numerical number or range specified in the specification should be 
considered to be modified by the term about. 

"Dose" and "serving" are used interchangeably and refer to the amount 
15 of the nutritional or pharmaceutical composition ingested by the patient in a 
single setting and designed to deliver effective amounts of the antioxidants and 
the structured triglyceride. As will be readily apparent to those skilled in the 
art, a single dose or serving of the liquid nutritional powder should supply the 
amount of antioxidants and PUFAs discussed above. The amount of the dose or 
20 serving should be a volume that a typical adult can consume in one sitting. This 
amount can vary widely depending upon the age, weight, sex or medical 
condition of the patient. However as a general guideline, a single serving or 
dose of a liquid nutritional produce should be considered as encompassing a 
volume from 100 to 600 ml, more preferably from 125 to 500 ml and most 
25 preferably from 125 to 300 ml. 

The PUFAs of the present invention may also be added to food even 
when supplementation of the diet is not required. For example, the composition 
may be added to food of any type including but not limited to margarines, 
modified butters, cheeses, milk, yogurt, chocolate, candy, snacks, salad oils, 
30 cooking oils, cooking fats, meats, fish and beverages. 
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Pharmaceutical Applicati ns 

For pharmaceutical use (human or veterinary), the compositions are 
generally administered orally but can be administered by any route by which 
they may be successfully absorbed, e.g., parenterally (i.e. subcutaneously, 
5 intramuscularly or intravenously), rectally or vaginally or topically, for 

example, as a skin ointment or lotion. The PUFAs of the present invention may 
be administered alone or in combination with a pharmaceutically acceptable 
carrier or excipient. Where available, gelatin capsules are the preferred form of 
oral administration. Dietary supplementation as set forth above also can 

1 0 provide an oral route of administration. The unsaturated acids of the present 
invention may be administered in conjugated forms, or as salts, esters, amides 
or prodrugs of the fatty acids. Any pharmaceutically acceptable salt is 
encompassed by the present invention; especially preferred are the sodium, 
potassium or lithium salts. Also encompassed are the N-alkylpolyhydrdxamine 

1 5 salts, such as N-methyl glucamine, found in PCT publication WO 96/33 155. 
The preferred esters are the ethyl esters. As solid salts, the PUFAs also can be 
administered in tablet form. For intravenous administration, the PUFAs or 
derivatives thereof may be incorporated into commercial formulations such as 
Intralipids. The typical normal adult plasma fatty acid profile comprises 6.64 to 

20 9.46% of ARA, 1 .45 to 3 . 1 1 % of DGLA, and 0.02 to 0.08% of GLA. These 
PUFAs or their metabolic precursors can be administered, either alone or in 
mixtures with other PUFAs, to achieve a normal fatty acid profile in a patient. 
Where desired, the individual components of formulations may be individually 
provided in kit form, for single or multiple use. A typical dosage of a particular 

25 fatty acid is from 0. 1 mg to 20 g, or even 1 00 g daily, and is preferably from 1 0 
mg to 1, 2, 5 or 10 g daily as required, or molar equivalent amounts of 
derivative forms thereof. Parenteral nutrition compositions comprising from 
about 2 to about 30 weight percent fatty acids calculated as triglycerides are 
encompassed by the present invention; preferred is a composition having from 

30 about 1 to about 25 weight percent of the total PUFA composition as GLA 

(USPN 5,196,198). Other vitamins, and particularly fat-soluble vitamins such 

as vitamin A, D, E and L-carnitine can optionally be included. Where desired a 
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preservative such as a tocopherol may be added, typically at about 0.1% by 
weight. 

Suitable pharmaceutical compositions may comprise physiologically 
acceptable sterile aqueous or non-aqueous solutions, dispersions, suspensions or 
5 emulsions and sterile powders for reconstitution into sterile injectible solutions 
or dispersions. Examples of suitable aqueous and non-aqeuous carriers, 
diluents, solvents or vehicles include water, ethanol, polyols (propylleneglyol, 
polyethylenegycol, glycerol, and the like), suitable mixtures thereof, vegetable 
oils (such as olive oil) and injectable organic esters such as ehyl oleate. Proper 

1 0 fluidity can be maintained, for example, by the maintenance of the required 

particle size in the case of dispersions and by the use of surfactants. It may also 
be desirable to include isotonic agents, for example sugars, sodium chloride and 
the like. Besides such inert diluents, the composition can also include 
adjuvants, such as wetting agents, emulsifying and suspending agents, 

1 5 sweetening, flavoring and perfuming agents. 

Suspensions in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 
aluminum metahydroxide, bentonite, agar-agar and tragacanth, or mixtures of 
20 these substances and the like. 

An especially preferred pharmaceutical composition contains 
diacetyltartaric acid esters of mono- and diglycerides dissolved in an aqueous 
medium or solvent. Diacetyltartaric acid esters of mono- and diglycerides have 
an HLB value of about 9-12 and are significantly more hydrophilic than existing 

25 antimicrobial lipids that have HLB values of 2-4. Those existing hydrophobic 
lipids cannot be formulated into aqueous compositions. As disclosed herein, 
those lipids can now be solubilized into aqueous media in combination with 
diacetyltartaric acid esters of mono-and diglycerides. In accordance with this 
embodiment, diacetyltartaric acid esters of mono- and diglycerides (e.g., 

30 DATEM-C12:0) is melted with other active antimicrobial lipids (e.g., 1 8:2 and 
12:0 monoglycerides) and mixed to obtain a homogeneous mixture. 
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Homogeneity allows for increased antimicrobial activity. The mixture can be 
completely dispersed in water. This is not possible without the addition of 
diacetyltartaric acid esters of mono- and diglycerides and premixing with other 
monoglycerides prior to introduction into water. The aqueous composition can 
5 then be admixed under sterile conditions with physiologically acceptable 

diluents, preservatives, buffers or propellants as may be required to form a spray 
or inhalant. 

The present invention also encompasses the treatment of numerous 
disorders with fatty acids. Supplementation with PUFAs of the present 

1 0 invention can be used to treat restenosis after angioplasty. Symptoms of 

inflammation, rheumatoid arthritis, and asthma and psoriasis can be treated with 
the PUFAs of the present invention. Evidence indicates that PUFAs may be 
involved in calcium metabolism, suggesting that PUFAs of the present 
invention may be used in the treatment or prevention of osteoporosis and of 

1 5 kidney or urinary tract stones. 

The PUFAs of the present invention can be used in the treatment of 
cancer. Malignant cells have been shown to have altered fatty acid 
compositions; addition of fatty acids has been shown to slow their growth and 
cause cell death, and to increase their susceptibility to chemotherapeutic agents. 

20 GLA has been shown to cause reexpression on cancer cells of the E-cadherin 
cellular adhesion molecules, loss of which is associated with aggressive 
metastasis. Clinical testing of intravenous administration of the water soluble 
lithium salt of GLA to pancreatic cancer patients produced statistically 
significant increases in their survival. PUFA supplementation may also be 

25 useful for treating cachexia associated with cancer. 

The PUFAs of the present invention can also be used to treat diabetes 
(USPN 4,826,877; Horrobin et aL, Am. J. Clin. Nutr. Vol. 57 (Suppl.), 732S- 
737S). Altered fatty acid metabolism and composition has been demonstrated 
in diabetic animals. These alterations have been suggested to be involved in 
30 some of the long-term complications resulting from diabetes, including 
retinopathy, neuropathy, nephropathy and reproductive system damage. 
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Primrose oil, which contains GLA, has been shown to prevent and reverse 
diabetic nerve damage. 

The PUFAs of the present invention can be used to treat eczema, reduce 
blood pressure and improve math scores. Essential fatty acid deficiency has 
5 been suggested as being involved in eczema, and studies have shown beneficial 
effects on eczema from treatment with GLA. GLA has also been shown to 
reduce increases in blood pressure associated with stress, and to improve 
performance on arithmetic tests. GLA and DGLA have been shown to inhibit 
platelet aggregation, cause vasodilation, lower cholesterol levels and inhibit 

1 0 proliferation of vessel wall smooth muscle and fibrous tissue (Brenner et al , 
Adv. Exp. Med. Biol. Vol. 83, p. 85-101, 1976). Administration of GLA or 
DGLA, alone or in combination with EPA, has been shown to reduce or prevent 
gastro-intestinal bleeding and other side effects caused by non-steroidal anti- 
inflammatory drugs (USPN 4,666,701). GLA and DGLA have also been shown 

1 5 to prevent or treat endometriosis and premenstrual syndrome (USPN 4,758,592) 
and to treat myalgic encephalomyelitis and chronic fatigue after viral infections 
(USPN 5,116,871). 

Further uses of the PUFAs of this invention include use in treatment of 
AIDS, multiple schlerosis, acute respiratory syndrome, hypertension and 
20 inflammatory skin disorders. The PUFAs of the inventions also can be used for 
formulas for general health as well as for geriatric treatments. 

Veterinary Applications 

It should be noted that the above-described pharmaceutical and 
nutritional compositions may be utilized in connection with animals, as well as 
25 humans, as animals experience many of the same needs and conditions as 

human. For example, the oil or acids of the present invention may be utilized in 
animal feed supplements or as animal feed substitutes. 

The following examples are presented by way of illustration, not of 
limitation. 
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Example 1 Isolation of A5 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 2 Isolation of A6 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 3 Identification of A6 Desaturases Homologues to the 
Mortierella alpina A Desaturase 

Example 4 Isolation of D-12 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 5 Isolation of Cytochrome b5 Reductase Nucleotide 
Sequence from Mortierella alpina 
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Example 7 Fatty Acid Analysis of Leaves from Ma29 Transgenic 
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Example 8 Expression of M. alpina A6 Desaturase in Brassica 
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Example 9 Expression of M. alpina A 12 desaturase in Brassica 
napus 

Example 10 Simultaneous expression of M alpina A6 and A 12 
desaturases in Brassica napus 

Example 1 1 Simultaneous expression of M alpina A5 and A6 
desaturases in Brassica napus 
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Example 15 Combined Expression of A6 and A12 Desaturases in B. 
napus Achieved by Crossing 

Example 1 6 Expression of M. alpina desaturases in soybean 

Example 1 7 Human Desaturase Gene Sequences 

5 Example 1 

Isolation of a A5-d esaturase Nucleotide Sequence from Mortierella alpina 
Motierella alpina produces arachidonic acid (ARA, 20:4) from the 
precursor 20:3 by a AS-desaturase. A nucleotide sequence encoding the A5- 
desaturase from Mortierella alpina (see Figure 7) was obtained through PCR 
10 amplification using M. alpina 1 st strand cDNA and degenerate oligonucleotide 
primers corresponding to amino acid sequences conserved between A6- 
desaturases from Synechocystis and Spirulina. The procedure used was as 
follows: 

Total RNA was isolated from a 3 day old PUFA-producing culture of 
1 5 Mortierella alpina using the protocol of Hoge et al (1982) Experimental 

Mycology 6:225-232. The RNA was used to prepare double-stranded cDNA 
using BRL's lambda-ZipLox system, following the manufacturer's instructions. 
Several size fractions of the M. alpina cDNA were packaged separately to yield 
libraries with different average-sized inserts. The "full-length" library contains 
20 approximately 3 x 1 0 6 clones with an average insert size of 1 .77 kb. The 
"sequencing-grade" library contains approximately 6 x 10 5 clones with an 
average insert size of 1.1 kb. 

5*ig of total RNA was reverse transcribed using BRL Superscript RTase 
and the primer TSyn 5 * -C AAGCTTCTGC AGG AGCTCTTTTTTTTTTTTTTT- 
25 3' (SEQ ID NO:19.) Degenerate oligonucleotides were designed to regions 
conserved between the two cyanobacterial A6-desaturase sequences. The 
specific primers used were: 
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D6DESAT-F3 (SEQ ID NO:20) 

5'-CUACUACUACUACAYCAYACOTAYACOAAYAT-3' 
D6DESAT-R3 ( SEQ ID NO:21) 

5 ' -C AUC AUC AUC AUOGGRAAO ARRTGRTG-3 ' 

5 where Y=C+T, R=A+G, and 0=I+C. PCR amplification was carried out in a 
25ul volume containing: template derived from 40 ng total RNA, 2 pM each 
primer, 200 jxM each deoxyribonucleotide triphosphate, 60 mM Tris-Cl, pH 8.5, 
15 mM (NH4)2S0 4 , 2 mM MgCl 2 . Samples were subjected to an initial 
desaturation step of 95 degrees (all temperatures Celsius) for 5 minutes, then 

10 held at 72 degrees while 0.2 U of Taq polymerase were added. PCR 

thermocycling conditions were as follows: 94 degrees for 1 min., 45 degrees 
for 1 .5 min., 72 degrees for 2 min. PCR was continued for 35 cycles. PCR 
using these primers on the M. alpina first-strand cDNA produced a 550 bp 
reaction product. Comparison of the deduced amino acid sequence of the M. 

1 5 alpina PCR fragment revealed regions of homology with A6-desaturases (see 
Figure 4). However, there was only about 28% identity over the region 
compared. The deduced amino acid sequence is presented in SEQ ID NO: 14. 

The PCR product was used as a probe to isolate corresponding cDNA 
clones from a M. alpina library. The longest cDNA clone, Ma29, was 

20 designated pCGN552 1 and has been completely sequenced on both strands. 
The cDNA is contained as a 1481 bp insert in the vector pZLl (Bethesda 
Research Laboratories) and, beginning with the first ATG, contains an open 
reading frame encoding 446 amino acids. The reading frame contains the 
sequence deduced from the PCR fragment. The sequence of the cDNA insert 

25 was found to contain regions of homology to A6-desaturases (see Figure 8). For 
example, three conserved "histidine boxes" (that have been observed in other 
membrane-bound desaturases (Okuley et al, (1994) The Plant Cell 5:147-158)) 
were found to be present in the Mortierella sequence at amino acid positions 
171-175, 207-212, and 387-391 (see Figure 5A-5D). However, the typical 

30 "HXXHH" amino acid motif for the third histidine box for the Mortierella 
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desaturase was found to be QXXHH. The amino-terniinus of the encoded 
protein, showed significant homology to cytochrome b5 proteins. Thus, the 
Mortierella cDNA clone appears to represent a fusion between a cytochrome b5 
and a fatty acid desaturase. Since cytochrome b5 is believed to function as the 
5 electron donor for membrane-bound desaturase enzymes, it is possible that the 
N-terminal cytochrome b5 domain of this desaturase protein is involved in its 
function. This may be advantageous when expressing the desaturase in 
heterologous systems for PUFA production. 

Example 2 

0 Isolation of A6 Desaturase Nucle otide Sequence from Mortierella almna 
A nucleic acid sequence from a partial cDNA clone, Ma524, encoding a 
A6 fatty acid desaturase from Mortierella alpina was obtained by random 
sequencing of clones from the M. alpina cDNA library described in Example 1. 
cDNA-containing plasmids were excised as follows: 

5 Five |il of phage were combined with 100 ul of E. coli DH10B(ZIP) 

grown in ECLB plus 10 ug/ml kanamycin, 0.2% maltose, and 10 mM MgS0 4 
and incubated at 37 degrees for 15 minutes. 0.9 ml SOC was added and 100 ul 
of the bacteria immediately plated on each of 10 ECLB + 50 ug Pen plates. No 
45 minute recovery time was needed. The plates were incubated overnight at 37 

0 degrees. Colonies were picked into ECLB + 50 ug Pen media for overnight 
cultures to be used for making glycerol stocks and miniprep DNA. An aliquot 
of the culture used for the miniprep is stored as a glycerol stock. Plating on 
ECLB + 50 ug Pen/ml resulted in more colonies and a greater proportion of 
colonies containing inserts than plating on 1 00 ug/ml Pen. 

5 Random colonies were picked and plasmid DNA purified using Qiagen 

miniprep kits. DNA sequence was obtained from the 5* end of the cDNA insert 
and compared to the databases using the BLAST algorithm. Ma524 was 
identified as a putative A6 desaturase based on DNA sequence homology to 
previously identified A6 desaturases. A full-length cDNA clone was isolated 
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from the M, alpina library. The abundance of this clone appears to be slightly 
(2X) less than Ma29. Ma524 displays significant homology to a portion of a 
Caenorhabditis elegans cosmid, W06D2.4, a cytochrome b5/desaturase fusion 
protein from sunflower, and the two A6 desaturases in the public databanks 
5 those from Synechocystis and Spirulina. 

In addition, Ma524 shows significant homology to the borage A6- 
desaturase sequence (PCT publication WO 96/21022). Ma524 thus appears to 
encode a A6-desaturase that is related to the borage and algal A6-desaturases. It 
should be noted that, although the amino acid sequences of Ma524 and the 

10 borage A6 are similar, the base composition of the cDNAs is quite different: the 
borage cDNA has an overall base composition of 60 % A+T, with some regions 
exceeding 70 %, while Ma524 has an average of 44 % A+T base composition, 
with no regions exceeding 60 %. This may have implications for expressing the 
cDNAs in microorganisms or animals which favor different base compositions. 

15 It is known that poor expression of recombinant genes can occur when the host 
has a very different base composition from that of the introduced gene. 
Speculated mechanisms for such poor expression include decreased stability or 
translatability of the mRNA. 

Example 3 

20 Identifica tion of A6-desaturases Homologous 

to the Mortier ella alpina A6-desaturase 

Nucleic acid sequences that encode putative A6-desaturases were 

identified through a BLASTX search of the est databases through NCBI using 

the Ma524 amino acid sequence. Several sequences showed significant 

25 homology. In particular, the deduced amino acid sequence of two Arabidopsis 
thaliana sequences, (accession numbers F 13728 and T42806) showed 
homology to two different regions of the deduced amino acid sequence of 
Ma524. The following PCR primers were designed: ATTS4723-FOR 
(complementary to F13728) S'-CUACUACUACUAGGAGTCCTCTA 

30 CGGTGTTTTG, SEQ ID NO:22, and T42806-REV (complementary to 
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T42806) 5' CAUCAUCAUCAUATGATGCTCAAGCTGAAACTG, SEQ ID 
NO:23. Five ug of total RNA isolated from developing siliques of Arabidopsis 
thaliana was reverse transcribed using BRL Superscript RTase and the primer 
TSyn 5'-CC AAGCTTCTGCAGGAGCTC 1 U 1 1 1 1 1 U 1 IT1T -3', (SEQ ID 
5 NO:24). PCR was carried out in a 50 ul volume containing: template derived 
from 25 ng total RNA, 2 pM each primer, 200 uM each deoxyribonucleotide 
triphosphate, 60 mM Tris-Cl, pH 8.5, 15 mM (NH^SCm, 2 mM MgCl 2 , 0.2 U 
Taq Polymerase. Cycle conditions were as follows: 94 degrees for 30 sec, 50 
degrees for 30 sec, 72 degrees for 30 sec. PCR was continued for 35 cycles 

1 0 followed by an additional extension at 72 degrees for 7 minutes. PCR resulted 
in a fragment of -750 base pairs which was subsequently subcloned, named 12- 
5, and sequenced. Each end of this fragment corresponds to the Arabidopsis 
est from which the PCR primers were derived. This is the sequence named 12-5. 
The deduced amino acid sequence of 12-5 is compared to that of Ma524 and 

15 ests from human (W28140), mouse (W53753), and C. elegans (R05219) in 
Figure 4. Based on homology, these sequences represent desaturase 
polypeptides. The full-length genes can be cloned using probes based on the est 
sequences. The genes can then be placed in expression vectors and expressed in 
host cells and their specific A6- or other desaturase activity can be determined 

20 as described below. 



Example 4 

Isolation of A-12 Desaturase Nucleotid e Sequence fr 0m Mortirrrlla nl pinn 
Based on the fatty acids it accumulates, Mortierella alpina has an o>6 
type desaturase. The co6 desaturase is responsible for the production of linoleic 
acid (18:2) from oleic acid (18:1). Linoleic acid (18:2) is a substrate for a A6 
desaturase. This experiment was designed to determine if Mortierella alpina 
has a A12-desaturase polypeptide, and if so, to identify the corresponding 
nucleotide sequence. A random colony from the M. alpina sequencing grade 
library, Ma648, was sequenced and identified as a putative desaturase based on 
DNA sequence homology to previously identified desaturases, as described for 
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Ma524 (see Example 2). The deduced amino acid sequence from the 5' end of 
the Ma648 cDNA displays significant homology to soybean microsomal a>6 
(A 12) desaturase (accession #L43921) as well as castor bean oleate 12- 
hydroxylase (accession #U22378). In addition, homology is observed to a 
5 variety of other cd6 (A12) and <o3 (A15) fatty acid desaturase sequences. 

Example 5 

Isolation of Cyt ochrome bS Reductase Nucleotide Sequence 
from Mortierella alpina 

A nucleic acid sequence encoding a cytochrome b5 reductase from 

1 0 Mortierella alpina was obtained as follows. A cDNA library was constructed 

based on total RNA isolated from Mortierella alpina as described in Example 1 . 

DNA sequence was obtained from the 5* and 3' ends of one of the clones, M12- 

27. A search of public databanks with the deduced amino acid sequence of the 

3' end of M12-27 (see Figure 5) revealed significant homology to known 

1 5 cytochrome b5 reductase sequences. Specifically, over a 49 amino acid region, 

the Mortierella clone shares 55% identity (73% homology) with a cytochrome 

b5 reductase from pig (see Figure 4). 

Example 6 

Expression of M. alpina Desaturase Clones in Baker's Yeast 
20 Yeast Transformation 

Lithium acetate transformation of yeast was performed according to 
standard protocols (Methods in Enzymology, Vol. 194, p. 186-187, 1991). 
Briefly, yeast were grown in YPD at 30°C. Cells were spun down, resuspended 
in TE, spun down again, resuspended in TE containing 100 mM lithium acetate, 
25 spun down again, and resuspended in TE/lithium acetate. The resuspended 
yeast were incubated at 30°C for 60 minutes with shaking. Carrier DNA was 
added, and the yeast were aliquoted into tubes. Transforming DNA was added, 
and the tubes were incubated for 30 min. at 30°C. PEG solution (35% (w/v) 
PEG 4000, 100 mM lithium acetate, TE pH7.5) was added followed by a 50 
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min. incubation at 30°C. A 5 min. heat shock at 42°C was performed, the cells 
were pelleted, washed with TE, pelleted again and resuspended in TE. The 
resuspended cells were then plated on selective media. 

Desaturase E xpression in Transformed Yeast 

5 cDNA clones from Mortierella alpina were screened for desaturase 

activity in baker's yeast. A canola Al 5-desaturase (obtained by PCR using 1 st 
strand cDNA from Brassica napus cultivar 212/86 seeds using primers based on 
the published sequence (Arondel et al. Science 258:1353-1355)) was used as a 
positive control. The A 1 5-desaturase gene and the gene from cDNA clone 

1 0 Ma29 was put in the expression vector p YES2 (Invitrogen), resulting in 

plasmids pCGR-2 and pCGR-4, respectively. These plasmids were transfected 
into S. cerevisiae yeast strain 334 and expressed after induction with galactose 
and in the presence of substrates that allowed detection of specific desaturase 
activity. The control strain was S. cerevisiae strain 334 containing the unaltered 

1 5 pYES2 vector. The substrates used, the products produced and the indicated 
desaturase activity were: DGLA (conversion to ARA would indicate A5- 
desaturase activity), linoleic acid (conversion to GLA would indicate A6- 
desaturase activity; conversion to ALA would indicate Al 5-desaturase activity), 
oleic acid (an endogenous substrate made by S. cerevisiae, conversion to 

20 linoleic acid would indicate Al 2-desaturase activity, which S. cerevisiae lacks), 
or ARA (conversion to EPA would indicate Al 7-desaturase activity). The 
results are provided in Table 1 below. The lipid fractions were extracted as 
follows: Cultures were grown for 48-52 hours at 15°C. Cells were pelleted by 
centrifugation, washed once with sterile ddH 2 0, and repelleted. Pellets were 

25 vortexed with methanol; chloroform was added along with tritridecanoin (as an 
internal standard). The mixtures were incubated for at least one hour at room 
temperature or at 4°C overnight. The chloroform layer was extracted and 
filtered through a Whatman filter with one gram of anhydrous sodium sulfate to 
remove particulates and residual water. The organic solvents were evaporated 

30 at 40°C under a stream of nitrogen. The extracted lipids were then derivatized 
to fatty acid methyl esters (FAME) for gas chromatography analysis (GC) by 
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adding 2 ml of 0.5 N potassium hydroxide in methanol to a closed tube. The 
samples were heated to 95°C to 100°C for 30 minutes and cooled to room 
temperature. Approximately 2 ml of 14 % boron trifluoride in methanol was 
added and the heating repeated. After the extracted lipid mixture cooled, 2 ml 
5 of water and 1 ml of hexane were added to extract the FAME for analysis by 
GC. The percent conversion was calculated by dividing the product produced 
by the sum of (the product produced and the substrate added) and then 
multiplying by 100. To calculate the oleic acid percent conversion, as no 
substrate was added, the total linoleic acid produced was divided by the sum of 
10 (oleic acid and linoleic acid produced), then multiplying by 100. 
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Table 1 

M. alnina De saturase Expression in Baker's Yeast 

CLONE TYPE OF ENZYME % CONVERSION 

ACTIVITY OF SUBSTRATE 

pCGR-2 A6 0 (18:2 to 18:3co6) 

(canolaAlS A15 16.3 (18:2 to 18:3o>3) 

desaturase) A5 2.0 (20:3 to 20:4a>6) 

A17 2.8 (20:4 to 20:5(03) 

A12 1.8 (18:1 to 18:2a>6) 

pCGR-4 A6 0 

(M. alpina A15 0 

A6-like, Ma29) A5 15.3 

A17 0.3 

A12 3.3 

pCGR-7 A6 0 

(M. alpina A15 3.8 

A12-like, Ma648 A5 2.2 

A17 0 

A12 63.4 



The A15-desaturase control clone exhibited 16.3% conversion of the 
5 substrate. The pCGR-4 clone expressing the Ma29 cDNA converted 15.3% of 
the 20:3 substrate to 20:4w6, indicating that the gene encodes a A5-desaturase. 
The background (non-specific conversion of substrate) was between 0-3% in 
these cases. The pCGR-5 clone expressing the Ma524 cDNA showed 6% 
conversion of the substrate to GLA, indicating that the gene encodes a A6- 
10 desaturase. The pCGR-7 clone expressing the Ma648 cDNA converted 63.4% 
conversion of the substrate to LA, indicating that the gene encodes a A12- 
desaturase. Substrate inhibition of activity was observed by using different 
concentrations of the substrate. When substrate was added to 100 |iM, the 
percent conversion to product dropped as compared to when substrate was 
15 added to 25 \iM (see below). These data show that desaturases with different 
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substrate specificities can be expressed in a heterologous system and used to 
produce PUFAs. 

Table 2 represents fatty acids of interest as a percent of the total lipid 
extracted from the yeast host S. cerevisiae 334 with the indicated plasmid. No 
5 glucose was present in the growth media. Affinity gas chromatography was 
used to separate the respective lipids. GC/MS was employed to verify the 
identity of the produces). The expected product for the B. napus A15- 
desaturase, a-linolenic acid, was detected when its substrate, linoleic acid, was 
added exogenously to the induced yeast culture. This finding demonstrates that 

10 yeast expression of a desaturase gene can produce functional enzyme and 
detectable amounts of product under the current growth conditions. Both 
exogenously added substrates were taken up by yeast, although slightly less of 
the longer chain PUFA, dihomo-y-linolenic acid (20:3), was incorporated into 
yeast than linoleic acid (18:2) when either was added in free form to the induced 

15 yeast cultures, y-linolenic acid was detected when linoleic acid was present 
during induction and expression of S. cerevisiae 334 (pCGR-5). The presence 
of this PUFA demonstrates A6-desaturase activity from pCGR-5 (MA524). 
Linoleic acid, identified in the extracted lipids from expression of S. cerevisiae 
334 (pCGR-7), classifies the cDNA MA648 from M alpina as the A12- 

20 desaturase. 
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Example 7 

Expression of AS Desaturase in Plants 
Expression in Leaves 

This experiment was designed to determine whether leaves expressing 
5 Ma29 (as determined by Northern) were able to convert exogenously applied 
DGLA (20:3) to ARA (20:4). 

The Ma29 desaturase cDNA was modified by PCR to introduce 
convenient restriction sites for cloning. The desaturase coding region has been 
inserted into a d35 cassette under the control of the double 35S promoter for 
1 0 expression in Brassica leaves (pCGN5525) following standard protocols (see 
USPN 5,424,200 and USPN 5,106,739). Transgenic Brassica plants containing 
pCGN5525 were generated following standard protocols (see USPN 5,188,958 
and USPN 5,463,174). 

In the first experiment, three plants were used: a control, LP004-1, and 
1 5 two transgenics,, 5525-23 and 5525-29. LP004 is a low-linolenic Brassica 
variety. Leaves of each were selected for one of three treatments: water, GLA 
or DGLA. GLA and DGLA were purchased as sodium salts from NuChek Prep 
and dissolved in water at 1 mg/ml. Aliquots were capped under N 2 and stored at 
-70 degrees C. Leaves were treated by applying a 50 ul drop to the upper 
20 surface and gently spreading with a gloved finger to cover the entire surface. 
Applications were made approximately 30 minutes before the end of the light 
cycle to minimize any photo-oxidation of the applied fatty acids. After 6 days 
of treatment one leaf from each treatment was harvested and cut in half through 
the mid rib. One half was washed with water to attempt to remove 
25 unincorporated fatty acid. Leaf samples were lyophilized overnight, and fatty 
acid composition determined by gas chromatography (GC). The results are 
shown in Table 3. 



-49- 



BNSDOCID:<WO 9846764A1> 



WO 98/46764 



PCT/US98/07421 



to 



a 

! 

# o 
*5 

OJ 
en 

a 

s 

H 

•a 



s 

I 

e 
s 

<: 
s 



20:01 




o 


o 


0.01 


O 


0.01 


0.01 


O 


0.02 


o 


0.01 


0.01 


0.02 


0.01 


0.01 


0.01 


0.01 


© 


0.01 


20:00 


5? 


0.09 


0.15 


0.05 


0.05 


0.54 


0.49 


0.53 


0.50 


0.09 


0.09 


0.51 


m 
o 


0.12 


0.56 


0.51 


0.50 


0.59 


0.60 


18:04 




o 


o 


o 


o 


© 


o 


o 


o 


© 


© 


© 


© 


© 


© 


© 


© 


© 


© 


18:03 




45.52 


44.59 


49.91 


50.25 


46.29 


45.61 


43.66 


47.22 


46.55 


46.41 


46.69 


46.05 


44.62 


42.77 


45.52 


45.13 j 


43.89 


44.90 


*P 

CO 




o 


© 


© 


o 


0.08 


0.11 


1.63 


1.72 


2.12 


1.56 


2.42 


2.30 


0.07 


0.09 


0.03 


0.04 


0.04 


0.02 


18:02 




16.76 


16.86 


16.71 


16.16 


15.90 


14.54 


14.85 


15.29 


15.92 


16.66 


14.68 


15.22 


15.65 


15.96 


13.57 


13.54 


16.04 


16.07 


do 




0.98 


1.00 


0.87 


0.86 


0.86 


0.73 


0.82 


0.86 


0.82 


0.84 


0.73 


0.85 


0.91 


0.92 


0.86 


0.88 


0.67 


0.70 


18:lo 




1.54 


1.55 


1.27 




1.26 


1.35 


1.29 


1.36 


1.34 


>/-> 




1.32 


1.37 


1.58 




1.63 


1.67 


1.70 


18:01 




2.51 


2.56 


2.15 


2.07 


2.12 


2.08 


2.10 


2.22 


| 2.16 


2.35 


1.94 


2.17 


2.28 


2.50 


1.98 


2.51 


2.34 


2.41 


18:00 




2.63 


2.67 


2.37 


2.32 


2.10 


1.94 


2.37 


2.34 


oo 
VO 
<N 


2.75 


2.22 


2.20 


2.30 


2.69 


3.40 


3.60 


2.81 


2.84 


16:01 


5? 


0.08 


0.09 


0.09 


0.08 


0.11 


0.09 


0.09 


0.10 


0.07 


0.07 


0.09 


0.09 


0.14 


0.15 


0.23 


0.24 


0.07 


0.07 


16:00 




12.95 


13.00 


14.13 


13.92 


13.79 


12.80 


12.10 


12.78 


13.71 


14.10 


13.62 


13.92 


12.45 


12.67 


12.56 


13.07 


13.26 


13.53 


SPL 




*"> 








CI 


oo 


Ov 


o 




Csl 
T 


m 


Tt 
T 


■o 


VO 


r~- 


oo 


o\ 


© 


Treatment 




Water 












GLA 












DGLA 













-50- 



BNSDOCID:<WO 9846764A1> 



WO 98/46764 



PCTYUS98/07421 



a 
o 
U 
i 

<*> 

01 

H 



B 

eq 

.5 



E 

n 
o 

(A 

I 

« 
B 

s 



24:1 


5? 


0.18 


0.27 


0.25 


0.21 


0.17 


0.23 


0.17 


0.14 


0.20 


0.13 


0.14 


0.17 


0.13 


0.11 


0.20 


0.10 


0.18 


0.18 


24:0 




0.38 


0.36 


0.29 


0.28 


0.30 


5.89 


0.37 


0.36 


0.33 


0.38 


0.34 


0.33 


0.36 | 


0.41 


0.49 


0.52 


0.39 


0.37 


22:06 




© 


0.05 


0.05 


0.36 


0.20 


0.08 


0.19 


0.10 


0.29 


0.24 


0.24 


0.16 


0.21 


0.39 


0.22 


0.32 


0.23 


0.15 


22:03 




o 


0.02 


0.04 


0.03 


0.06 


0.09 


3.42 


0.05 


0.13 


0.02 


0.01 


0.05 


0.02 


0.09 


o 


0.05 


0.07 


0.04 


22:02 




16.26 


16.82 


11.29 


11.82 


15.87 


13.64 


16.25 


14.74 


13.15 


12.60 


14.73 


14.43 


18.67 


17.97 


17.96 


17.14 


17.26 


15.73 


22:01 




0.09 


0.10 


0.06 


0.04 


0.08 


0.07 


0.08 


0.10 


0.10 


0.11 


0.03 


0.07 


0.07 


0.09 


0.07 


0.09 


0.07 


0.07 


22:00 


ss 


0.01 


0.14 


0.12 


0.07 


0.18 


0.15 


0.10 


0.10 


0.20 


0.11 


0.10 


0.13 


0.07 


© 


0.11 


0.14 


0.10 


0.21 


20:05 


s? 


o 


o 


© 


o 


o 


© 


o 


o 


© 


© 


o 


© 


o 


o 


© 


© 


© 


o 


20:04 


ss 


0.29 


0.26 


0.25 


0.26 


0.21 


0.24 


0.27 


0.27 


0.27 


0.28 


0.28 


0.26 


0.26 


0.27 


0.96 


0.74 


1.11 


0.87 


20:03 




o 


o 


o 


0.01 | 


o 


© 


0.01 


© 


© 


© 


o 


o 




1.94 


0.69 


0.70 


0.35 


0.20 


20:02 




© 


0.01 


0.01 


o 


0.02 


0.01 


0.02 


0.01 


© 


© 


0.01 


0.02 


0.06 


© 


0.01 


0.01 


o 


o 


SPL 




f> 


■«*• 


v> 
rr> 


VO 
««■> 


r- 


oo 


On 
to 


© 




CM 


en 


T 


m 


VO 
T 


r- 
rr 


oo 


On 


o 


Treatment 




j Water 












GLA 












| DGLA 













-51- 



BNSOOCID: <WO 8848764A1> 



WO 98/46764 



PCT/US98/07421 



Leaves treated with GLA contained from 1 .56 to 2.4 wt% GLA. The fatty acid 
analysis showed that the lipid composition of control and transgenic leaves was 
essentially the same. Leaves of control plants treated with DGLA contained 
1.2-1.9 w% DGLA and background amounts of ARA (.26-.27 wt%). 
5 Transgenic leaves contained only .2-.7 wt% DGLA, but levels of ARA were 
increased (.74-1 . 1 wt%) indicating that the DGLA was converted to ARA in 
these leaves. 

Expression in Seed 

The purpose of this experiment was to determine whether a construct 
1 0 with the seed specific napin promoter would enable expression in seed. 

The Ma29 cDNA was modified by PCR to introduce ATioI cloning sites 
upstream and downstream of the start and stop codons, respectively, using the 
following primers: 

Madxho-forward: 

1 5 5'-CUACUACUACUACTCGAGCAAGATGGGAACGGACCAAGG 
(SEQ ID NO:25) 

Madxho-reverse: 

5'-CAUCAUCAUCAUCTCGAGCTACTCTTCCTTGGGACGGAG 
(SEQ ID NO:26). 

20 The PCR product was subcloned into pAMP 1 (GIBCOBRL) using the 

CloneAmp system (GIBCOBRL) to create pCGN5522 and the A5 desaturase 
sequence was verified by sequencing of both strands. 

For seed-specific expression, the Ma29 coding region was cut out of 
pCGN5522 as anXhoI fragment and inserted into the Sail site of the napin 
25 expression cassette, pCGN3223, to create pCGN5528. The //wdlll fragment of 
pCGN5528 containing the napin 5' regulatory region, the Ma29 coding region, 
and the napin 3' regulatory region was inserted into the Hindlll site of 
pCGN1557 to create pCGN5531. Two copies of the napin transcriptional unit 
were inserted in tandem. This tandem construct can permit higher expression of 
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the desaturases per genetic loci. pCGN5531 was introduced into Brassica 
napus cv.LP004 via Agrobacterium mediated transformation. 

The fatty acid composition of twenty-seed pools of mature T2 seeds was 
analyzed by GC. Table 4 shows the results obtained with independent 
5 transformed lines as compared to non-transformed LP004 seed. The transgenic 
seeds containing pCGN553 1 contain two fatty acids that are not present in the 
control seeds, tentatively identified as taxoleic acid (5,9-18:2) and pinolenic 
acid (5,9,12-18:3), based on their elution relative to oleic and linoleic acid. 
These would be the expected products of A5 desaturation of oleic and linoleic 
1 0 acids. No other differences in fatty acid composition were observed in the 
transgenic seeds. 



-53- 



WO 98/46764 



PCT/US98/07421 



1 

o 

p 

o 

o 
o 

s. 

a 

o 

U 



o 

rr ^ 


fN 
d 


fN 

d 


o 
tn 

d 


VO 
fN 

d 


m 
d 


O 


CN 

d 




fN 


o 
d 


m 
o 
d 


o 
d 


fN 

o 
d 


o 


o 


d 




o 

fN ^ 


m 
vo 
d 


d 


d 


os 
d 


o 
d 


d 


vo 
cn 

d 




© ^ 
fN 


m 
o 
d 


o 
d 


o 


»n 
o 
d 


s 

d 


o 


© 
d 




i' * 


© 


O 


OO 








vs 




§ * 

fN 


OS 

o 


OS 

d 


m 
o 


s 


OO 

Os 

d 


VO 
OS 

d 


cn 
oo 
d 






*o 

vo 


OO 

m 


o 
m 


rn 


cn 


OS 
m 


Os 
m 




(5,9,12)18:3 
% 


o 
d 


rn 
cn 

d 


r- 

fN 

d 


OO 

m 
d 


CN 

m 
d 


cn 

cn 

d 


«0 

d 




OO ^ 


OO 


CN 


m 


Os 


OO 

06 


oo 

OS 

oo 


OS 

d 

CN 




(5,9)18:2 




O 


vs 


fN 

vo* 




m 
© 


VO 

rn 
*n 






Os 
VO 


m 
m 

fN 
vo 


OO 

vo 
vo 


vq 
rn 
vo 


fN 
oo 

m 
vo 


m 


vo 
(N 
vo 




OO ^ 


cn 


m 
fN 
rn 


m 
rn 


rn 


oo 

<N 

cn 


cn 
rn 

rn 


oo 
fN 






d 


d 


d 


m 
d 


d 


d 


rn 
d 






vo 

Op 

rn 


vo 
CN 


OO 

rn 


OO 

m' 


vo 

OS 

rn 


Os 
rn 


oo 
rn 






1 

c 

8 
23 


i 

m 


fN 
i 

m 


vo 

m 
tr> 


o 
i 

m 
*r» 


VO 
i 

m 
vn 


oo 

fN 
i 

m 
in 





-54- 



BNSDOCtD:<WO 98467 64 A 1 > 



98/46764 



PCT/US98/07421 



Northern analysis is performed on plants to identify those expressing 
Ma29. Developing embryos are isolated approximately 25 days post anthesis or 
when the napin promoter is induced, and floated in a solution containing GLA 
or DGLA as described in Example 7. Fatty acid analysis of the embryos is then 
performed by GC to determine the amount of conversion of DGLA to ARA, 
following the protocol adapted for leaves in Example 7. The amount of ARA 
incorporated into triglycerides by endogenous Brassica acyltransferases is then 
evaluated by GC analysis as in Example 7. 

Example 8 

Expression of M. al pina A6 Desaturase in Brassica napus 
The Ma524 cDNA was modified by PCR to introduce cloning sites 
using the following primers: 

Ma524PCR-l (SEQ IDNO:27) 

5-CUACUACUACUATCTAGACTCGAGACCATGGCTGCTGCT 
CCAGTGTG 

Ma524PCR-2 (SEQ ID NO:28) 

S'-CAUCAUCAUCAUAGGCCTCGAGTTACTGCGCCTTACCCAT 

These primers allowed the amplification of the entire coding region and 
added Xbal and Xhol sites to the 5'-end and Xhol and Stul sites to the 3* end. 
The PCR product was subcloned into pAMPl (GIBCOBRL) using the 
CloneAmp system (GIBCOBRL) to create pCGN5535 and the A6 desaturase 
sequence was verified by sequencing of both strands. 

For seed-specific expression, the Ma524 coding region was cut out of 
pCGN5535 as an Xhol fragment and inserted into the Sail site of the napin 
expression cassette, pCGN3223, to create pCGN5536. The Notl fragment of 
pCGN5536 containing the napin 5" regulatory region, the Ma524 coding region, 
and the napin 3' regulatory region was inserted into the Notl site of pCGNl557 
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to create pCGN5538. pCGN5538 was introduced into Brassica napus 
cv.LP004 via Agrobacterium mediated transformation. 

Maturing T2 seeds were collected from 6 independent transformation 
events in the greenhouse. The fatty acid composition of single seeds was 
5 analyzed by GC. Table 5 shows the results of control LP004 seeds and six 5538 

lines. All of the 5538 lines except #8 produced seeds containing GLA. 
Presence of GLA segregated in these seeds as is expected for the T2 selfed seed 
population. In addition to GLA, the M alpina A6 desaturase is capable of 
producing 18:4 (stearidonic) and another fatty acid believed to be the 6,9-18:2. 

1 0 The above results show that desaturases with three different substrate 

specificities can be expressed in a heterologous system and used to produce 
poly-unsaturated long chain fatty acids. Exemplified were the production of 
ARA (20:4) from the precursor 20:3 (DGLA), the production of GLA (18:3) 
from 18:2 substrate, and the conversion of 18:1 substrate to 18:2, which is the 

1 5 precursor for GLA. 
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Example 9 

Expression of M. alpina A12 desaturase in Brassica napus 
The Ma648 cDNA was modified by PCR to introduce cloning sites 
using the following primers: 

5 Ma648PCR-for (SEQ ID NO:29) 

S'-CUACUACUACUAGGATCCATGGCACCTCCCAACACT 
Ma648PCR-rev (SEQ ID NO:30) 

5'-CAUCAUCAUCAUGGTACCTCGAGTTACTTCTTGAAAAAGAC 

These primers allowed the amplification of the entire coding region and 
0 added a BamHI site to the 5' end and Kpnl and Xhol sites to the 3' end. The 

PCR product was subcloned into pAMPl (GIBCOBRL) using the CloneAmp 
system (GIBCOBRL) to create pCGN5540 and the A12 desaturase sequence 
was verified by sequencing of both strands. 

For seed-specific expression, the Ma648 coding region was cut out of 
5 pCGN5540 as a BamHI/XhoI fragment and inserted between the Bglll and 

Xhol sites of the napin expression cassette, pCGN3223, to create pCGN5542. 
The Asp718 fragment of pCGN5541 containing the napin 5' regulatory region, 
the Ma648 coding region, and the napin 3' regulatory region was inserted into 
the Asp718 site of pCGN5138 to create pCGN5542. PCGN5542 was 
10 introduced into two varieties of Brassica napus via Agrobacterium mediated 

transformation. The commercial canola variety, SP30021, and a low-linolenic 
line, LP30108 were used. 

Mature selfed T2 seeds were collected from 19 independent LP30108 
transformation events and a non-transformed control grown in the greenhouse. 
15 These seeds are expected to be segregating for the A 12 desaturase transgene. 

The fatty acid composition of20-seed pools was analyzed by GC. The results 
are shown in Table 6. All transformed lines contained increased levels of 18:2, 
the product of the A12 desaturase. Levels of 18:3 were not significantly 
increased in these plants. Events # 1 1 and 16 showed the greatest accumulation 
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of 18:2 in the pooled seeds. To investigate the segregation of 18:2 levels in the 
T2 seeds and to identify individual plants to be taken on to subsequent 
generations, half-seed analysis was done. Seeds were germinated overnight in 
the dark at 30 degrees on water-soaked filter paper. The outer cotyledon was 

5 excised for GC analysis and the rest of the seedling was planted in soil. Results 

of some of these analyses are shown in Table 7. Individual T2 seeds containing 
the M alpina A 12 desaturase accumulated up to 60% 18:2 in the seeds. Sample 
97xxl 116 #59 is an example of a null segregant. Even in the highest 18:2 
accumulators, levels of 18:3 were increased only slightly. These and other 

0 individually selected T2 plants were grown in the greenhouse and in the field to 

produce T3 seed. 

Mature selfed T2 seeds were collected from 20 independent SP30021 
transformation events and a non-transformed control grown in the greenhouse. 
These seeds are expected to be segregating for the A12 desaturase transgene. 

5 The fatty acid composition of 20-seed pools was analyzed by GC. The data are 

presented in Table 8. All transformed lines contained increased levels of 18:2, 
the product of the A12 desaturase. As in the low-linolenic LP30108 line, levels 
of 18:3 were not significantly increased. Events # 4 and 12 showed the greatest 
accumulation of 1 8:2 in the pooled seeds. To investigate the segregation of 

0 1 8:2 levels in the T2 seeds and to identify individual plants to be taken on to 

subsequent generations, alf-seed analysis was done. Seeds were germinated 
overnight in the dark at 30 degrees on water-soaked filter paper. The outer 
cotyledon was excised for GC analysis and the rest of the seedling was planted 
in soil. Results of some of these analyses are shown in Table 9. Samples 

15 97xxl 157 #88 and #18 are examples of null segregants for 5542-SP30021-4 and 

5542-SP30021-12 respectively. These and other individually selected T2 plants 
were grown in the greenhouse and in the field to produce T3 seed 
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Example 10 

Simultaneous ex pression of M. alpina A6 and A12 
desaturases in Brassica nanus 

In order to express the M. alpina A6 and A12 desaturases from the same 
T-DN A, the following construct for seed-specific expression was made. 

The NotI fragment of pCGN5536 containing the containing the napin 5' 
regulatory region, the Ma524 coding region, and the napin 3' regulatory region 
was inserted into the NotI site of pCGN5542 to create pCGN5544. The 
expression modules were oriented in such a way that the direction of 
transcription from Ma524 and Ma648 and the nptll marker is the same. 

PCGN5544 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 16 independent LP30108 transformation events and a non-transformed 
1 5 control that were grown in the greenhouse. These seeds are expected to be 

segregating for the A6+ A12 desaturase transgene. The fatty acid composition 
of 20-seed pools was analyzed by GC. The results are presented in Table 10. 
All but one of the lines (5544-LP30 108-3) shows an altered oil composition as 
compared to the controls. GLA was produced in all but three of the lines (-3, -4, 
20 -1 1); two of the three without GLA ( -4, -1 1) showed increased 18:2 indicative 

of expression of the A 12 desaturase. As a group, the levels of GLA observed in 
plants containing the double A6 + A12 construct (pCGN5544) were higher than 
those of plants containing pCGN5538 (A6 alone). In addition, levels of the A 6,9 
1 8:2 are much reduced in the plants containing the A12 + A6 as compared to A6 
25 alone. Thus, the combination of A6 and A 1 2 desaturases on one T-DN A leads 

to the accumulation of more GLA and fewer side products than expression of 
A6 desaturase alone. To investigate the segregation of GLA levels in the T2 
seeds and to identify individual plants to be taken on to subsequent generations, 
half-seed analysis was done. Seeds were germinated overnight in the dark at 30 
degrees on water-soaked filter paper. The outer cotyledon was excised for GC 
analysis and the rest of the seedling was planted in soil. Results of some of 
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these analyses are shown in Table 1 1. As expected for the T2 population, levels 
of GLA and 1 8:2 are segregating in the individual seeds. GLA content of up to 
60% of total fatty acids was observed in individual seeds. Individual events 
were selected to be grown in the greenhouse and field for production of T3 
seed. 

Transgenic plants including Brassica, soybean, safflower, corn flax and 
sunflower expressing the constructs of this invention can be a good source of 
GLA. 

Typical sources of GLA such as borage produce at most 25% GLA. In 
contrast the plants in Table 1 0 contain up to 30% GLA. Furthermore, the 
individual seeds shown in Table 1 1 contain up to 60% GLA. 
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Example 11 

Simultaneous expression of M. giving AS and A6 
desaturases in Brassica napus 

In order to produce arachadonic acid (ARA) in transgenic canola oil 

both A5 and A6 desaturase activities need to be introduced. In order to facilitate 

downstream characterization and breeding, it may be advantageous to have both 

activities encoded by a single T-DNA. The following example illustrates the 

simultaneous expression of A5 and A6 desaturases. 

The Asp718 fragment of pCGN5528 containing the napin 5 r regulatory 
region, the Ma29 coding region, and the napin 3' regulatory region was inserted 
into the Asp718 site of pCGN5138 to create pCGN5545. The NotI fragment of 
pCGN5536 containing the napin 5 f regulatory region, the Ma524 coding region, 
and the napin 3' regulatory region was inserted into the NotI site of pCGN5545 
to create pCGN5546. The expression modules were oriented in such a way that 
the direction of transcription from Ma524 and Ma29 and the nptll marker is the 
same. 

PCGN5546 was introduced into Brassica napus cv.LP30 1 08 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 30 independent LP30108 transformation events that were grown in the 
greenhouse. The fatty acid composition of 20-seed pools was analyzed by GC. 
The results are shown in Table 12. All the lines show expression of both 
desaturases as evidenced by the presence of A 5,9 18:2 (as seen in pCGN5531 
plants) and A 6,9 18:2 and GLA (as seen in pCGN5538 plants) 
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Example 12 

Simultaneous expression of M. giving AS. A6 and A12 
desaturases in Brassica napus 

5 In order to achieve optimal production of ARA in transgenic canola oil 

both the A6 and A12 desaturase activities may need to be present in addition to 

the A5 activity. In order to facilitate downstream characterization and breeding, 

it may be advantageous to have all of these activities encoded by a single T- 

DNA. The following example illustrates the simultaneous expression of A5, A6 

1 0 and A 1 2 desaturases. 

The HindlH fragment of pCGN5528 containing the napin 5* regulatory 
region, the Ma29 coding region, and the napin 3* regulatory region was inserted 
into the HindlH site of pCGN5544 to create pCGN5547. The expression 
modules were oriented in such a way that the direction of transcription from 
1 5 Ma29, Ma524, Ma648 and the nptll marker is the same. 

PCGN5547 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 30 independent LP30108 transformation events that were grown in the 
greenhouse. The fatty acid composition of 20-seed pools was analyzed by GC. 

20 The results are shown in Table 1 3 . Twenty-seven of the lines show significant 

accumulation of GLA and in general the levels of GLA observed are higher 
than those seen in the 5546 plants that did not contain the A12 desaturase. The 
A 12 desaturase appears to be active in most lines as evidenced by the lack of 
detectable A6,9 18:2 and elevated 18:2 levels in most plants. Small amounts of 

25 A5,9 1 8:2 are seen in the 5547 plants, although the levels are generally less than 

those observed in the 5546 plants. This may be due to the presence of the A 12 
desaturase which efficiently converts the 18:1 to 18:2 before it can be 
desaturated at the A5 position. 
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Example 13 

Stereospecific Distribution of A6-Desaturated Oils 

This experiment was designed to investigate the stereospecific 
distribution of the A6-desaturated oils in seeds expressing pCGN5538 (Ma 524 
5 cDNA). Three seed samples were used: 

1) Non-transformed B. napus cv. LP004 seeds (control) 

2) Segregating T2 seeds of pCGN5538-LP004-19 

3) Segregating T2 seeds of pCGN5538-LP004-29 
The following protocol was used for the analysis: 

10 1 . Seed Oil Extraction 

Fifty seeds were placed in a 12 x 32 mm vial and crushed with a glass 
rod. 1 .25 mL hexane was added and the mixture was vortexed. The seeds were 
extracted overnight on a shaker. The extract was then filtered through a 0.2 
micron filter attached to a lcc syringe. The extract was then dried down under 
1 5 nitrogen. The resulting oil was used for digestion and derivatization of the 

whole oil sample. 

2. Digestion 

A. Liquid Oil Digestion 

The stock lipase (from Rhizopus arrhizus, Sigma, L4384) was diluted to 
20 approximately 600,000 units/mL with a goal of obtaining 50% digestion of the 

TAG. The stock lipase is maintained at 4 degrees C and placed on ice. The 
amount of reagents may be adjusted according to the amount of oil to be 
digested. 

The following amounts are based on a 2.0 mg extracted oil sample. In a 
25 12x32 mm screw cap vial the following were added: 2.0 mg oil, 200 jiL 0.1 M 

tris HCi pH 7, 40 \xh 2.2 w/v% CaCl 2 2H 2 Q, and 100 \iL 0.05 w/v % bile salts. 
The material was vortexed and sonicated to disperse the oil. Twenty \xL of 
diluted lipase was added and the mixture was vortexed continuously for 1 .0 
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minute at room temperature. A white precipitate formed. The reaction was 
stopped with 1 00 uL 6M HC1 and vortexing. Five hundred uL CHCl 3 :CH 3 OH 
(2:1) was added and the mixture was vortexed and held on ice while reaining 
digestions were carried out. Samples were vortexed again and centrifuged 
5 briefly to sharpen layers. The lower layer containing digest products was 

removed with a pasteur pipette and placed in a 12 x 32 mm crimp cap vial. The 
material was then re-extracted with 300 uL CHC1 3 , vortexed, centrifuged, and 
combined with the lower layers. The digest products were kept on ice as much 
as possible. HPLC separation is performed as soon as possible after digestion to 
10 minimize acyl migration. 

B. Solid Fat Digestion 

The procedure for liquid oil digestion described above was followed 
except that 20 jil 1 1 :0 methyl ester is added to 2.0 mg solid fat. 

3. HPLC Separation 

1 5 ^ digestion products were dried down in chloroform to approximately 

200 \iL. Each sample was then transferred into an insert in an 8 x 40 mm shell 
vial and 30 uL was injected for HPLC analysis. 

The high performance liquid chromatographic system was equipped 
with a Varex ELSD IIA evaporative light scattering detector with tube 
20 temperature at 1 05°C and nitrogen gas flow at 40 mL/min; a Waters 7 1 2 Wisp 

autosampler, three Beckman 1 14M Solvent Delivery Modules; a Beckman 
421 A controller, a Rheodyne pneumatically actuated stream splitter; and a 
Gilson micro fractionator. The chromatography column is a 220 x 4.6 mm, 5 
micron normal phase silica cartridge by Brownlee. 

25 The three solvents used were: 

A= hexane:toluene 1:1 

B= toluene: ethyl acetate 3 : 1 

C— 5% formic acid in ethyl acetate 
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The gradient profile was as follows: 



Time (min) 


Function 


Value 


Duration 


Oflow 


2.0 mlVmin 






0%B 


10 






0%C 


2 






2%C 


25 




6 min 


14.0 % C 


2 




1 min 


15.0 


End program 







A chromatographic standard mixture is prepared in hexane: toluene 1:1 
containing the following: 

0.2 mg/mL triglyceride 16:0 



5 2.0 mg/mL 16:0 Free Fatty Acid 

0.2 mg/mL dil6:0 mixed isomers ( 1 ,2-diacylglycerol and 1,3-diacylglycerol) 
0.2 mg/mL 3-mono acylglycerol 16:0 
0.2 mg/mL 2-mono acylglycerol 16:0 

For each sample, the fraction containing the 2-mag peak is collected 
10 automatically by method controlled timed events relays. A time delay is used to 

synchronize the detector with the collector's emitter. The 2-mag peaks are 
collected and the fractions are evaporated at room temperature overnight. 

The sn-2 composition results rely on minimization of acyl migration. 
Appearance of 1-monoacylglycerol and/or 3-monoacylglycerol peaks in the 
15 chromatograph means that acyl migration has occurred. 

4. Derivatization 

To derivatize the whole oil, 1.0 mg of the extracted whole oil was 
weighed into a 12 x 32 mm crimp cap vial. One mL toluene was then added. 
The sample is then vortexed and a 50 \xL aliquot was removed for 
20 derivatization. To the dried down 2-mag samples, 50 jiL toluene was added. To 

both the whole oil and 2-mag fractions 105 uL H2SO4/CH3OH @ 8.76 wt% is 
added. The cap was tightly capped and the sample is refluxed for 1 hour at 95 
degrees C. The sample was allowed to cool and 500 uL 10 w/v % NaCI in 
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water and 60 uL heptane was added. The organic layer was removed and 
inserted in a 12 x 32 mm crimp cap vial. 

5. GLC Analysis 

A Hewlett Packard model 6890 GC equipped with a split/splitless 
capillary inlet, FID detector, 6890 series autosampler and 3392A Alpha Omega 
integrator is set up for the capillary column as follows: 



Supelco Omegawax 250, 30 m length, 0.25 mm id, 0.25 urn film 
thickness 



injection port: 


260 C 


detector: 


270 C 


initial temp: 


170 C 


initial time: 


1 .5 min 


rate: 


30 deg/min 


final temp: 


245 C 


final time: 


6.5 min 


injection vol: 


1.5 uL 


head pressure: 


25 psi 


split ratio: 


30 


carrier gas: 


He 


make-up gas: 


N 2 


FID gas: 


H + air 



Percent compositions of fatty acid methyl esters are calculated as mole 
percents. For carbon chain lengths less than 12, the use of theoretical or 
empirical response factors in the area percent calculation is desirable. 



-88- 



BNSDOCID:<WO 9846764A1> 



WO 98/46764 



PCT/US98/07421 



6. Calculations 

The mean distribution of each acyl group at each sn~ 1 and sn-3 position 
was calculated. 

mean sn-l and sn-3 composition = (3 WO comp - MAG comp) / 2 
5 WO = whole oil 

MAG= monoacylglycerol 

The results of this analysis are presented in Table 14. The GLA and A 6,9 
18:2 are evenly distributed between the sn-2 and sn-l, 3 positions. This 
analysis can not discriminate between fatty acids in the sn-l vs. sn-3 positions. 



BNSDOCID:<WO 9846764A1> 



-89- 



WO 98/46764 



PCI7US98/07421 



Z 



1 20:1 1 




0.57 


1.17 


1.47 




1.00 


1.07 






0.40 




1.47 






20:0 




0.21 


0.91 

i 


1.26 




0.43 


1.07 


1.39 




0.14 


0.99 


1.42 






18:4 




0.00 


0.06 


0.09 




0.32 


[ 0.50 


0.59 




0.38 


0.50 


0.56 






00 




2.01 




S 




ao 


o 


0.86 




o 

<o 


o 


0.85 






18:3_A6,9,12 




0.06 


00*0 


•0.03 




12.45 


13.16 


13.52 




12.99 


13.66 


14.00 






00 




29.45 


18.51 


13.04 




14.55 


10.57 


8.58 




17.85 


12.11 


9.24 






e\ 

5 

oo 




0.00 


0.18 


0.27 




5.61 


4.53 


3.99 




6.35 


4.99 


4.31 






18:1 




64.77 


69.29 


71.55 




57.21 


57.51 


57.66 




56.35 


54.92 


54.21 




lyte 


18:0 




0.37 


3.32 


4.80 




4.12 


4.09 


4.08 




1.56 


3.73 


4.82 




each anal 


16:1 




0.15 


0.20 


0.23 




0.27 


0.33 


0.36 




0.27 


0.32 


0.35 




sition Tor 


16:0 




1.23 


4.33 


5.88 




1.65 


5.44 


7.34 




1.24 


4.96 


6.82 




om the mag and whole oil compo 






sn2 composition 


whole oil composition 


an snl, sn3 composition* 




sn2 composition 


whole oil composition 


mean snl, sn3 composition* 




sn2 composition 


whole oil composition 


mean sn 1 , sn3 composition* 












me 
















1 




Control 










5538-19 








5538-29 








-2 
"3 

u 



-90- 



BNSDOCID: <WO 9846764A1> 



WO 98/46764 PCTYUS98/07421 



Example 14 

Fatty Acid Compositions of Transgenic Plants 

A5 and A6 transgenic plants were analyzed for their fatty acid content. 

The following protocol was used for oil extraction: 

5 1 . About 400 mg of seed were weighed out in duplicate for each 

sample. 

2. The seeds were crushed in a motar and pestle. The mortar and 
pestle was rinsed twice with 3ml (2 : 1 ) (v:v) 
CHCl 3 :CH 3 OH/MeOH. An additional 6 ml (2: 1) was added to 

10 the 20ml glass vial (oil extracted in 12ml total 2:1). 

3. Samples were vortexed and placed on an orbital shaker for 2 
hours with occasional vortexing. 

4. 5ml of lMNaCl was added to each sample. Sample was 
vortexed then spun in centrifuge at 2000rpm for 5 minutes. 

1 5 Lower phase was drawn off using a pasteur pipette. 

5. Upper phase was re-extracted with an additional 5ml. Sample 
was vortexed then spun in centrifuge at 2000 rpm for 5 minutes. 
The lower phase was drawn off using a pasteur pipette and added 
to previous lower phase. 

20 6. CHChiCHaOH /MeOH was evaporated under nitrogen using 

evaporative cooling. Vial containing extracted oil was sealed 
under nitrogen. Between 120mg- 160mg oil was extracted for 
each sample. 

For GC-MS analysis, fatty acid methyl esters were dissolved in an 
25 appropriate volume of hexane and analyzed using a Hewlett-Packard 5890 

Series II Plus gas chromatograph (Hewlett Packard, Palo Alto, CA) equipped 
with a 30 m x 0.32 mm i.d. Omegawax 320 fused sillica capillary column 
(Supelco, Bellefonte, PA) and a Hewlett-Packard 5972 Series mass selective 
detector. Mass spectra were intrepreted by comparison to the mass spectra in 
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NIST/EPA/NIH Chemical Structure Database using a MS Chem Station 
(#G1036A) (Hewlett Packard). 

Transgenic line 5531-6 was analyzed in duplicate (A, B) and compared 
to control line LP004-6. The fatty acid profile results are shown in Table 15. 

5 Transgenic line 5538-19 was analyzed in duplicate (A, B) and compared 

to control line LP004-6. The fatty acid profile results are shown in Table 16. 
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Table 15 
Fattv Acid Profile 
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J^J M. ~\9r\ 
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I pi ^ftA* 
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001f0103d 


WU A 1V1V1.U 
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0.053 




U.lfO I 
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4.034 
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0.200 




C16:2 
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0.065 
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C17:0 
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0.244 
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0 151 


C16:4 










C18:0 


2.608 


2.714 


3.368 


3.417 


C18:lw9 


65.489 


66.454 


59.529 


59.073 


C18:lw7 
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C1S.2 5,9 
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6.269 
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1.428 
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Table IS 
Fattv Acid Profile 
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TOTAL 
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Table 16 
Fatty Acid Profile 





trio i n a 

5538-19 A 


5538-1 9B 


LP004-6A 


LP004-6B 




TRANSGENIC 


TRANSGENIC 


CONTROL 


CONTROL 














LRJL-2166 


LRL-2167 


LRL-2168 


LRL-2169 












A 

co:u 


0.004 


0.005 






Co:u 


0.007 


0.007 


0.004 


0.005 




0.012 


0.012 


0.008 


0.008 


C12:U 


0.020 


0.020 


0.011 


0.012 


C13:0 










f^l /I. A 


0.099 


0.108 


0.050 


0.050 


/~"| ,4.1 i-.fi 












A ACA 

0.059 


0.068 


0.017 


0.019 




5.2 11 


5.294 


4.049 


4.057 


^lu« 1 


ft i <ft 
0.350 


0.417 


0.197 


0.208 




ft 1 oo 
0. 1 V9 


A 1 O") 

0.187 


0.076 


0.077 


fi7'n 

^* 1 /• u 


ft ftOI 

o.oyz 


0.089 


0.078 


0.077 




ft 1 vtO 


0.149 


0.192 


0.198 






0.010 








J.O 1 J 


3.771 


2.585 


2.638 


t^lo. 1 


57.562 


57.051 


68.506 


68.352 




4.246 


4.022 
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Table 16 
Fatty Acid Profile 





5538-1 9 A 


5538-19B 


LP004-6A 


LP004-6B 




TRANSGENIC 


TRANSGENIC 


CONTROL 


CONTROL 














LRL-2166 


LRL-2167 


LRL-2168 


LRL-2169 












C20:4w3 










C20:5w3 










C22:0 


0.506 


0.484 


0.535 


0.539 


C22:l 


0.017 


0.020 


0.032 


0.032 


czi:5 




0.040 


0.030 


0.031 


C22:4w6 


0.038 


0.064 


0.015 


0.014 


C22:5w6 










C22:5w3 


0.023 


0.018 


0.021 


0.017 


C24:0 


0.352 


0.321 


0.353 


0.362 


C22:6w3 


0.009 








C24:lw9 
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TOTAL 


100.00 


100.00 


100.00 


100.00 
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Example 15 

Combined Expression of A6 and A12 
Desaturases in B. napus Achieved by Crossing 

Plants containing either the A6 or the A12 desaturase were crossed and 

5 individual Fl half-seeds were analyzed for fatty acid composition by GC. Data 

from one such cross are given in Table 17. The parents for the cross were 

5538-LP004-25-2-25 (A6 expressor) and 5542-SP30021-10-16 (A12 expressor). 

Reciprocal crosses were made and the results of 25 individual Fl seeds of each 

are shown in the table. Crosses are described such that the first parent indicated 

10 is the female. Both sets of crosses gave approximately the same results. 

Compared to the parents, the A 6,9 18:2 decreased, and the GLA increased. A 9 ' 12 

1 8:2 levels are increased in most of the Fl's as well. Note that these are F 1 

seeds and only contain one set of each desaturase. In future generations and 

selection of events homozygous for each desaturase, the F2 GLA levels 

1 5 obtained may be even higher. 

Combining traits by crossing may be preferable to combining traits on 
. one T-DN A in some situations. Particularly if both genes are driven off of the 
same promoter (in this case napin), issues of promoter silencing may favor this 
approach over putting nultiple cDNAs on one construct. 

20 Alternatively, in some cases, combining multiple cDNAs on one T-DNA 

may be the method of choice. The results are shown in Table 17. 
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Example 16 

Expression of M. alpina desaturases in soybean 
The M. alpina desaturases can be used to drive production of GLA and 
other PUFAs in soybean by use of the following expression constructs. Two 
5 means by which exogenous DNA can be inserted into the soybean genome are 

Agrobacterium infection or particle gun. Particle gun transformation is 
disclosed in U.S. patent 5,503,998. Plants can be selected using a glyphosate 
resistance marker (4, 971, 908). Agrobacterium transformation of soybean is 
well established to one of ordinary skill in the art. 

10 For seed specific expression, the coding regions of the desaturase 

cDNAs are placed under control of the 5 1 regulatory region of Glycine max 
alpha-type beta conglycinin storage protein gene. The specific region that can 
be used is nucleotides 78-921 of gi 169928 (Doyle, J. J., Schuler, M.A., ' 
Godette, W.D., Zenger, V., Beachy, R.N., and Slightom. J.L., 1986 J. Biol. 

1 5 Chem. 261 (20), 9228-9238). The 3' regulatory region that can be used is from 

the pea ribulose 1,5 bisphosphate carboxylase/oxygenase small subunit (rbcS) 
- gene. The specific sequences to be used are nucleotides 1-645 of gi 169145 
(Hunt, A.G. 1988 DNA 7: 329-336). 

Since soybean seeds contain more 18:2, and perhaps more endogenous 
20 A 1 2 desaturase activity than Brassica, the effect of the Mortierella A 1 2 

desaturase on achieving optimal GLA levels can be tested as follows. A 
construct containing the A6 cDNA can be used to see if A 6,9 18:2 is produced 
along with GLA. A construct containing the A 12 desaturase can be used to see 
if the amount of 18:2 can be increased in soybean. A construct containing both 
25 the A6 and A 12 desaturases can be used to produce optimal levels of GLA. 

Alternatively, plants containing each of the single desaturases may be crossed if 
necessary to combine the genes. 

Similar constructs may be made to express the A5 desaturase alone, or in 
combination with A 12 and/or A6 desaturases. 



-101- 



BNSDOCID: <WO 9848764A1> 



WO 98/46764 PCT/US98/07421 



Example 17 

Human Desaturase Gene Sequences 
Human desaturase gene sequences potentially involved in long chain 
polyunsaturated fatty acid biosynthesis were isolated based on homology 
5 between the human cDNA sequences and Mortierella alpina desaturase gene 

sequences. The three conserved "histidine boxes" known to be conserved 
among membrane-bound desaturases were found. As with some other 
membrane-bound desaturases the final HXXHH histidine box motif was found 
to be QXXHH. The amino acid sequence of the putative human desaturases 
1 0 exhibited homology to M. alpina A5, A6, A9, and A 1 2 desaturases. 

The M. alpina A5 desaturase and A6 desaturase cDNA sequences were 
used to search the LifeSeq database of Incyte Pharmaceuticals, Inc., Palo Alto, 
California 94304. The A5 desaturase sequence was divided into fragments; 1) 
amino acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301- 

15 446. The A6 desaturase sequence was divided into three fragments; 1) amino 

acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301-457. 
These polypeptide fragments were searched against the database using the 
"tblastn" algorithm. This alogarithm compares a protein query sequence against 
a nucleotide sequence database dynamically translated in all six reading frames 

20 (both strands). 

The polypeptide fragments 2 and 3 of M. alpina A5 and A6 have 
homologies with the ClonelD sequences as outlined in Table 18. The ClonelD 
represents an individual sequence from the Incyte LifeSeq database. After the 
"tblastn" results have been reviewed, Clone Information was searched with the 

25 default settings of Stringency of >=50, and Productscore <=1 00 for different 

ClonelD numbers. The Clone Information Results displayed the information 
including the ClusterlD, ClonelD, Library, HMD, Hit Description. When 
selected, the ClusterlD number displayed the clone information of all the clones 
that belong in that ClusterlD. The Assemble command assembles all of the 

30 ClonelD which comprise the ClusterlD. The following default settings were 
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used for GCG (Genetics Computer Group, University of Wisconsin 
Biotechnology Center, Madison, Wisconsin 53705) Assembly: 

Word Size: 7 

5 Minimum Overlap: 14 

Stringency: 0.8 

Minimum Identity: 1 4 

Maximum Gap: 1 0 

Gap Weight: 8 

10 Length Weight: 2 

GCG Assembly Results displayed the contigs generated on the basis of 
sequence information within the ClonelD. A contig is an alignment of DNA 
sequences based on areas of homology among these sequences. A new 

15 sequence (consensus sequence) was generated based on the aligned DNA 

sequences within a contig. The contig containing the ClonelD was identified, 
and the ambiguous sites of the consensus sequence was edited based on the 
alignment of the ClonelDs (see SEQ ID NO:31 - SEQ ID NO:35) to generate 
the best possible sequence. The procedure was repeated for all six ClonelD 

20 listed in Table 18. This produced five unique contigs. The edited consensus 

sequences of the 5 contigs were imported into the Sequencher software program 
(Gene Codes Corporation, Ann Arbor, Michigan 48 105). These consensus 
sequences were assembled. The contig 251 1785 overlaps with contig 3506132, 
and this new contig was called 2535 (SEQ ID NO:37). The contigs from the 

25 Sequencher program were copied into the Sequence Analysis software package 

of GCG. 

Each contig was translated in all six reading frames into protein 
sequences. The M alpina A5 (MA29) and A6 (MA524) sequences were 
compared with each of the translated contigs using the FastA search (a Pearson 
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and Lipman search for similarity between a query sequence and a group of 
sequences of the same type (nucleic acid or protein)). Homology among these 
sequences suggest the open reading frames of each contig. The homology 
among the M. alpina A5 and A6 to contigs 2535 and 3854933 were utilized to 
create the final contig called 253538a. Figure 9 is the FastA match of the final 
contig 253538a and MA29, and Figure 10 is the FastA match of the final contig 
253538a and MA524. The DNA sequences for the various contigs are 
presented in SEQ ID NO:31 -SEQ ID NO:37 The various peptide sequences 
are shown in SEQ ID NO:38 - SEQ ID NO: 44. 

Although the open reading frame was generated by merging the two 
contigs, the contig 2535 shows that there is a unique sequence in the beginning 
of this contig which does not match with the contig 3854933. Therefore, it is 
possible that these contigs were generated from independent desaturase like 
human genes. 

The contig 253538a contains an open reading frame encoding 432 
amino acids. It starts with Gin (CAG) and ends with the stop codon (TGA). 
The contig 253538a aligns with both M alpina A5 and A6 sequences, 
suggesting that it could be either of the desaturases, as well as other known 
desaturases which share homology with each other. The individual contigs 
listed in Table 18, as well as the intermediate contig 2535 and the final contig 
253538a can be utilized to isolate the complete genes for human desaturases. 

Uses of the Human Desaturases 

These human sequences can be expressed in yeast and plants utilizing 
the procedures described in the preceding examples. For expression in 
mammalian cells and transgenic animals, these genes may provide superior 
codon bias. In addition, these sequences can be used to isolate related 
desaturase genes from other organisms. 
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Table 18 



Sections of the 
Desaturases 


Clone ID from LifeSeq Database 


Keyword 


151-300 A5 


3808675 


fatty acid desaturase 


301-446 A5 


354535 


A6 


151-300 A6 


3448789 


A6 


151-300 A6 


1362863 


A6 


151-300 A6 


2394760 


A6 


301-457 A6 


3350263 


A6 



Example 18 

Identification of Homologues to M. giving AS and A6 desaturases 

A nucleic acid sequence that encodes a putative A5 desaturase was 
identified through a TBLASTN search of the expressed sequence tag databases 
through NCBI using amino acids 100-446 of Ma29 as a query. The truncated 
portion of the Ma29 sequence was used to avoid picking up homologies based 
on the cytochrome b5 portion at the N-terminus of the desaturase. The deduced 
amino acid sequence of an est from Dictyostelium discoideum (accession # 
- C25549) shows very significant homology to Ma29 and lesser, but still 
significant homology to Ma524. The DNA sequence is presented as SEQ ID 
NO:45. The amino acid sequence is presented as SEQ ID NO:46. 

Example 19 

Identifica tion of M alpine AS and A6 homologues in other 
PUFA-pr oducing organisms 

To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Phaeodactylum tricornutum. A 

plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 

following manufacturer's instructions using a commercially available kit 

(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 

sequences that encode putative A5 or A6 desaturases were identified through 

BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 
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One clone was identified from the Phaeodactylum library with 
homology to Ma29 and Ma524; it is called 144-01 1-B12. The DNA sequence is 
presented as SEQ ID NO:47. The amino acid sequence is presented as SEQ ID 
NO:48. 

5 Example 20 

Identification of M. giving A5 and A6 homologues in other 
PUFA-producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Schizochytrium species. A 

10 plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 

following manufacturer's instructions using a commercially available kit 

(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 

sequences that encode putative A5 or A6 desaturases were identified through 

BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 

15 One clone was identified from the Schizochytrium library with 

homology to Ma29 and Ma524; it is called 81-23-C7. This clone contains a -1 
kb insert. Partial sequence was obtained from each end of the clone using the 
universal forward and reverse sequencing primers. The DNA sequence from 
the forward primer is presented as SEQ ID NO:49. The peptide sequence is 

20 presented as SEQ ID NO:50. The DNA sequence from the reverse primer is 

presented as SEQ ID NO:51. The amino acid sequence from the reverse primer 
is presented as SEQ ID NO:52. 

Example 21 

Nutritional Compositions 

25 The PUFAs of the previous examples can be utilized in various 

nutritional supplements, infant formulations, nutritional substitutes and other 
nutrition solutions. 

I. INFANT FORMULATIONS 

A. Isomil® Soy Formula with Iron. 
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Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's milk. A feeding for patients with disorders for which 
lactose should be avoided: lactase deficiency, lactose intolerance and 
galactosemia. 

5 Features: 

• Soy protein isolate to avoid symptoms of cow's-milk-protein 
allergy or sensitivity 

• Lactose-free formulation to avoid lactose-associated diarrhea 

• Low osmolaity (240 mOsm/kg water) to reduce risk of osmotic 
10 diarrhea. 

• Dual carbohydrates (corn syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

• 1 .8 mg of Iron (as ferrous sulfate) per 1 00 Calories to help 
1 5 prevent iron deficiency . 

• Recommended levels of vitamins and minerals. 

• Vegetable oils to provide recommended levels of essential fatty 
acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

20 Ingredients: (Pareve, ©) 85% water, 4.9% com syrup, 2.6% sugar 

(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1 % calcium phosphate tribasic, potassium citrate, potassium 
phosphate monobasic, potassium chloride, mono- and disglycerides, soy 
lecithin, carrageenan, ascorbic acid, L-methionine, magnesium chloride, 

25 potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 

sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
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acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin 

B. Isomil® DF Soy Formula For Diarrhea. 

Usage: As a short-term feeding for the dietary management of diarrhea 
5 in infants and toddlers. 

Features: 

• First infant formula to contain added dietary fiber from soy fiber 
specifically for diarrhea management. 

• Clinically shown to reduce the duration of loose, watery stools 
10 during mild to severe diarrhea in infants. 

• Nutritionally complete to meet the nutritional needs of the infant. 

• Soy protein isolate with added L-methionine meets or exceeds an 
infant's requirement for all essential amino acids. 

• Lactose-free formulation to avoid lactose-associated diarrhea. 

1 5 • Low osmolality (240 mOsm/kg water) to reduce the risk of 

osmotic diarrhea. 

• Dual carbohydrates (com syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

20 • Meets or exceeds the vitamin and mineral levels recommended 

by the Committee on Nutrition of the American Academy of Pediatrics 
and required by the Infant Formula Act. 

• 1 .8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

25 • Vegetable oils to provide recommended levels of essential fatty 

acids. 

Ingredients: (Pareve, @) 86% water, 4.8% com syrup, 2.5% sugar 
(sucrose), 2.1% soy oil, 2.0% soy protein isolate, 1.4% coconut oil, 0.77% soy 
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fiber, 0.12% calcium citrate, 0.1 1 % calcium phosphate tribasic, 0.10% 
potassium citrate, potassium chloride, potassium phosphate monobasic, mono- 
and disglycerides, soy lecithin, carrageenan, magnesium chloride, ascorbic acid, 
L-methionine, potassium phosphate dibasic, sodium chloride, choline chloride, 
5 taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L- 

carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin A 
palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

1 0 C. Isomil® SF Sucrose-Free Soy Formula With Iron. 

Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's-milk protein or an intolerance to sucrose. A feeding for 
patients with disorders for which lactose and sucrose should be avoided. 

Features: 

1 5 • Soy protein isolate to avoid symptoms of cow's-milk-protein 

allergy or sensitivity. 

• Lactose-free formulation to avoid lactose-associated diarrhea 
(carbohydrate source is Polycose® Glucose Polymers). 

• Sucrose free for the patient who cannot tolerate sucrose. 

20 • Low osmolality ( 1 80 mOsm/kg water) to reduce risk of osmotic 

diarrhea. 

• 1 .8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

• Recommended levels of vitamins and minerals. 

25 • Vegetable oils to provide recommended levels of essential fatty 

acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

Ingredients: (Pareve, ©) 75% water, 1 1.8% hydrolized cornstarch, 4.1% 
soy oil, 4.1% soy protein isolate, 2.8% coconut oil, 1.0% modified cornstarch, 
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0.38% calcium phosphate tribasic, 0.17% potassium citrate, 0.13% potassium 
chloride, mono- and disglycerides, soy lecithin, magnesium chloride, abscorbic 
acid, L-methionine, calcium carbonate, sodium chloride, choline chloride, 
carrageenan, taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc 
5 sulfate, L-carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin 

A palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

D. Isomil® 20 Soy Formula With Iron Ready To Feed, 
10 20CaI/floz. 

Usage: When a soy feeding is desired. 

Ingredients: (Pareve, <s>) 85% water, 4.9% corn syrup, 2.6% sugar 
(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1% calcium phosphate tribasic, potassium citrate, potassium 

15 phosphate monobasic, potassium chloride, mono- and disglycerides, soy 

lecithin, carrageenan, abscorbic acid, L-methionine, magnesium chloride, 
potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 
sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 

20 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin. 

E. Similac® Infant Formula 

Usage: When an infant formula is needed: if the decision is made to 
25 discontinue breastfeeding before age 1 year, if a supplement to breastfeeding is 

needed or as a routine feeding if breastfeeding is not adopted. 
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Features: 

• Protein of appropriate quality and quantity for good growth; 
heat-denatured, which reduces the risk of milk-associated enteric blood 
loss. 

5 • Fat from a blend of vegetable oils (doubly homogenized), 

providing essential linoleic acid that is easily absorbed. 

• Carbohydrate as lactose in proportion similar to that of human 
milk. 

• Low renal solute load to minimize stress on developing organs. 

1 0 • Powder, Concentrated Liquid and Ready To Feed forms. 

Ingredients: (®-D) Water, nonfat milk, lactose, soy oil, coconut oil, 
mono- and diglycerides, soy lecithin, abscorbic acid, carrageenan, choline 
chloride, taurine, m-inositol, alpha-tocopheryl acetate, zinc sulfate, niacinamid, 
ferrous sulfate, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
15 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, phylloquinone, biotin, sodium selenite, vitamin D 3 and 
cyanocobalamin 

F. Similac® NeoCare Premature Infant Formula With Iron 

Usage: For premature infants 1 special nutritional needs after hospital 
20 discharge. Similac NeoCare is a nutritionally complete formula developed to 

provide premature infants with extra calories, protein, vitamins and minerals 
needed to promote catch-up growth and support development. 

Features: 

• Reduces the need for caloric and vitamin supplementation. More 
25 calories (22 Cal/fl oz) then standard term formulas (20 Cal/fl oz). 

• Highly absorbed fat blend, with medium-chain triglycerides 
(MCT oil) to help meet the special digestive needs of premature infants. 

• Higher levels of protein, vitamins and minerals per 1 00 Calories 
to extend the nutritional support initiated in-hospital. 
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• More calcium and phosphorus for improved bone mineralization. 

Ingredients: ©-D Corn syrup solids, nonfat milk, lactose, whey protein 
concentrate, soy oil, high-oleic safflower oil, fractionated coconut oil (medium- 
chain triglycerides), coconut oil, potassium citrate, calcium phosphate tribasic, 

5 calcium carbonate, ascorbic acid, magnesium chloride, potassium chloride, 

sodium chloride, taurine, ferrous sulfate, m-inositol, choline chloride, ascorbyl 
palmitate, L-carnitine, alpha-tocopheryl acetate, zinc sulfate, niacinamide, 
mixed tocopherols, sodium citrate, calcium pantothenate, cupric sulfate, 
thiamine chloride hydrochloride, vitamin A palmitate, beta carotene, riboflavin, 

10 pyridoxine hydrochloride, folic acid, manganese sulfate, phylloquinone, biotin, 

sodium selenite, vitamin D3 and cyanocobalamin. 

G. Similac Natural Care Low-Iron Human Milk Fortifier Ready 
To Use, 24 Cal/fl oz. 

Usage: Designed to be mixed with human milk or to be fed alternatively 
1 5 with human milk to low-birth-weight infants. 

Ingredients: ®-D Water, nonfat milk, hydrolyzed cornstarch, lactose, 
fractionated coconut oil (medium-chain triglycerides), whey protein 
concentrate, soil oil, coconut oil, calcium phosphate tribasic, potassium citrate, 
magnesium chloride, sodium citrate, ascorbic acid, calcium carbonate, mono- 

20 and diglycerides, soy lecithin, carrageenan, choline chloride, m-inositol, taurine, 

niacinamide, L-carnitine, alpha tocopheryl acetate, zinc sulfate, potassium 
chloride, calcium pantothenate, ferrous sulfate, cupric sulfate, riboflavin, 
vitamin A palmitate, thiamine chloride hydrochloride, pyridoxine 
hydrochloride, biotin, folic acid, manganese sulfate, phylloquinone, vitamin D 3 , 

25 sodium selenite and cyanocobalamin. 

Various PUFAs of this invention can be substituted and/or added to the 
infant formulae described above and to other infant formulae known to those in 
the art.. 
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II. NUTRITIONAL FORMULATIONS 
A. ENSURE® 

Usage: ENSURE is a low-residue liquid food designed primarily as an 
oral nutritional supplement to be used with or between meeds or, in appropriate 
5 amounts, as a meal replacement. ENSURE is lactose- and gluten-free, and is 

suitable for use in modified diets, including low-cholesterol diets. Although it 
is primarily an oral supplement, it can be fed by tube. 

Patient Conditions: 

• For patients on modified diets 

1 0 • For elderly patients at nutrition risk 

• For patients with involuntary weight loss 

• For patients recovering from illness or surgery 

• For patients who need a low-residue diet 
Ingredients: 

1 5 . ^-D Water, Sugar (Sucrose), Maltodextrin (Com), Calcium and Sodium 

Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, Canola Oil, 
Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, Magnesium 
Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium Chloride, 
Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc Sulfate, 

20 Ferrous Sulfate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 

Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
Riboflavin, Folic Acid, Sodium Molybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate. 



25 



B. ENSURE® BARS 

Usage: ENSURE BARS are complete, balanced nutrition for 
supplemental use between or with meals. They provide a delicious, nutrient- 
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rich alternative to other snacks. ENSURE BARS contain <1 g lactose/bar, and 
Chocolate Fudge Brownie flavor is gluten-free. (Honey Graham Crunch flavor 
contains gluten.) 

Patient Conditions: 
5 • For patients who need extra calories, protein, vitamins and minerals 

• Especially useful for people who do not take in enough calories and 
nutrients 

• For people who have the ability to chew and swallow 

• Not to be used by anyone with a peanut allergy or any type of allergy to 
10 nuts. 

Ingredients: 

Honey Graham Crunch - High-Fructose Corn Syrup, Soy Protein- 
Isolate, Brown Sugar, Honey, Maltodextrin (Corn), Crisp Rice (Milled Rice, 
Sugar [Sucrose], Salt [Sodium Chloride] and Malt), Oat Bran, Partially 
1 5 Hydrogenated Cottonseed and Soy Oils, Soy Polysaccharide, Glycerine, Whey 

Protein Concentrate, Polydextrose, Fructose, Calcium Caseinate, Cocoa 
Powder, Artificial Flafors, Canola Oil, High-Oleic Safflower Oil, Nonfat Dry 
Milk, Whey Powder, Soy Lecithin and Corn Oil. Manufactured in a facility that 
processes nuts. 

20 Vitamins and Minerals: 

Calcium Phosphate Tribasic, Potassium Phosphate Dibasic, Magnesium 
Oxide, Salt (Sodium Chloride), Potassium Chloride, Ascorbic Acid, Ferric 
Orthophosphate, Alpha-Tocopheryl Acetate, Niacinamide, Zinc Oxide, Calcium 
Pantothenate, Copper Gluconate, Manganese Sulfate, Riboflavin, Beta- 
25 Carotene, Pyridoxine Hydrochloride, Thiamine Mononitrate, Folic Acid, Biotin, 

Chromium Chloride, Potassium Iodide, Sodium Selenate, Sodium Molybdate, 
Phylloquinone, Vitamin D3 and Cyanocobalamin. 
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Protein: 

Honey Graham Crunch - The protein source is a blend of soy protein isolate 
and milk proteins. 

Soy protein isolate 74% 
5 Milk proteins 26% 

Fat: 

Honey Graham Crunch - The fat source is a blend of partially 
hydrogenated cottonseed and soybean, canola, high oleic safflower, and corn 
oils, and soy lecithin. 

0 Partially hydrogenated cottonseed and soybean oil 76% 



Canola oil 8% 

High-oleic safflower oil 8% 

Com oil 4% 

Soy lecithin 4% 

15 Carbohydrate: 



Honey Graham Crunch - The carbohydrate source is a combination of 
high-fructose corn syrup, brown sugar, maltodextrin, honey, crisp rice, 
glycerine, soy polysaccharide, and oat bran. 



High-fructose com syrup 24% 

20 Brown sugar 21% 

Maltodextrin 12% 

Honey 11% 

Crisp rice 9% 

Glycerine 9% 

25 Soy polysaccharide jo/ Q 

Oat bran 7%\ 
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C. ENSURE® HIGH PROTEIN 

Usage: ENSURE HIGH PROTEIN is a concentrated, high-protein 
liquid food designed for people who require additional calories, protein, 
vitamins, and minerals in their diets. It can be used as an oral nutritional 
5 supplement with or between meals or, in appropriate amounts, as a meal 

replacement. ENSURE HIGH PROTEIN is lactose- and gluten-free, and is 
suitable for use by people recovering from general surgery or hip fractures and 
by patients at risk for pressure ulcers. 

Patient Conditions 

10 • For patients who require additional calories, protein, vitamins, and minerals, 

such as patients recovering from general surgery or hip fractures, patients at risk 
for pressure ulcers, and patients on low-cholesterol diets 

Features- 

• Low in saturated fat 

1 5 • Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

• Excellent source of protein, calcium, and other essential vitamins and 
minerals 

• For low-cholesterol diets 
20 • Lactose-free, easily digested 

Ingredients: 

Vanilla Supreme: -®-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium 
and Sodium Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, 
Canola Oil, Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, 
25 Magnesium Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium 

Chloride, Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc 
Sulfate, Ferrous Suffate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
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Riboflavin, Folio Acid, Sodium Motybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D.3 and 
Cyanocobalarnin. 

Protein: 

5 The protein source is a blend of two high-biologic-value proteins: casein and 

soy. 

Sodium and calcium caseinates 85% 
Soy protein isolate 1 5% 

Fat: 

10 The fat source is a blend of three oils: high-oleic safflower, canola, and soy. 

High-oleic safflower oil 40% 

Canola oil 30% 

Soy oil 30% 

The level of fat in ENSURE HIGH PROTEIN meets American Heart 
1 5 Association (AHA) guidelines. The 6 grams of fat in ENSURE HIGH 

PROTEIN represent 24% of the total calories, with 2.6% of the fat being from 
saturated fatty acids and 7.9% from polyunsaturated fatty acids. These values 
are within the AHA guidelines of < 30% of total calories from fat, < 1 0% of the 
calories from saturated fatty acids, and < 1 0% of total calories from 
20 polyunsaturated fatty acids. 

Carbohydrate: 

ENSURE HIGH PROTEIN contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla supreme, chocolate 
royal, wild berry, and banana), plus VARI-FLAVORSO® Flavor Pacs in pecan, 
25 cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 

patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 60 o/ 0 
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Maltodextrin 40% 
Chocolate 

Sucrose 70% 

Maltodextrin 20% 



r o 



f 0 



D. ENSURE ® LIGHT 

Usage: ENSURE LIGHT is a low-fat liquid food designed for use as an 
oral nutritional supplement with or between meals. ENSURE LIGHT is 
lactose- and gluten-free, and is suitable for use in modified diets, including low- 
10 cholesterol diets. 

Patient Conditions: 

• For normal-weight or overweight patients who need extra nutrition in a 
supplement that contains 50% less fat and 20% fewer calories than ENSURE 

• For healthy adults who don't eat right and need extra nutrition 
15 Features: 

• Low in fat and saturated fat 

• Contains 3 g of total fat per serving and < 5 mg cholesterol 

• Rich, creamy taste 

• Excellent source of calcium and other essential vitamins and minerals 
20 • For low-cholesterol diets 

• Lactose-free, easily digested 
Ingredients: 

French Vanilla: ©-D Water, Maltodextrin (Corn), Sugar (Sucrose), Calcium 
Caseinate, High-Oleic Safflower Oil, Canola Oil, Magnesium Chloride, Sodium 
25 Citrate, Potassium Citrate, Potassium Phosphate Dibasic, Magnesium Phosphate 

Dibasic, Natural and Artificial Flavor, Calcium Phosphate Tribasic, Cellulose 
Gel, Choline Chloride, Soy Lecithin, Carrageenan, Salt (Sodium Chloride), 
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Ascorbic Acid, Cellulose Gum, Ferrous Sulfate, Alpha-Tocopheryl Acetate, 
Zinc Sulfate, Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric 
Sulfate, Thiamine Chloride Hydrochloride, Vitamin A Palmitate, Pyridoxine 
Hydrochloride, Riboflavin, Chromium Chloride, Folic Acid, Sodium 
5 Molybdate, Biotin, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin 

D3 and Cyanocobalamin. 

Protein: 

The protein source is calcium caseinate. 

Calcium caseinate 100% 

10 Fat 

The fat source is a blend of two oils: high-oleic safflower and canola. 
High-oleic safflower oil 70% 
Canola oil 30% 

The level of fat in ENSURE LIGHT meets American Heart Association 
15 (AHA) guidelines. The 3 grams of fat in ENSURE LIGHT represent 13.5% of 

the total calories, with 1.4% of the fat being from saturated fatty acids and 2.6% 
from polyunsaturated fatty acids. These values are within the AHA guidelines 
of < 30% of total calories from fat, < 1 0% of the calories from saturated fatty 
acids, and < 1 0% of total calories from polyunsaturated fatty acids. 

20 Carbohydrate 

ENSURE LIGHT contains a combination of maltodextrin and sucrose. 
The chocolate flavor contains com syrup as well. The mild sweetness and 
flavor variety (French vanilla, chocolate supreme, strawberry swirl), plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
25 orange, help to prevent flavor fatigue and aid in patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 51% 

Maltodextrin 49% 
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Chocolate 

Sucrose 47.0% 

Com Syrup 26.5% 

Maltodextrin 26.5% 
Vitamins and Minerals 

An 8-fl-oz serving of ENSURE LIGHT provides at least 25% of the 
RDIs for 24 key vitamins and minerals. 

Caffeine 

Chocolate flavor contains 2.1 mg caffeine/8 fl oz. 
E. ENSURE PLUS® 



Usage: ENSURE PLUS is a high-calorie, low-residue liquid food for 
use when extra calories and nutrients, but a normal concentration of protein, are 
needed. It is designed primarily as an oral nutritional supplement to be used 
1 5 with or between meals or, in appropriate amounts, as a meal replacement. 

ENSURE PLUS is lactose- and gluten-free. Although it is primarily an oral 
nutritional supplement, it can be fed by tube. 

Patient Conditions: 

• For patients who require extra calories and nutrients, but a normal 
20 concentration of protein, in a limited volume 

• For patients who need to gain or maintain healthy weight 
Features 

• Rich, creamy taste 

• Good source of essential vitamins and minerals 
25 Ingredients 

Vanilla: ®-D Water, Com Syrup, Maltodextrin (Corn), Corn Oil, Sodium and 
Calcium Caseinates, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, 
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Potassium Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and 
Artificial Flavor, Sodium Citrate, Potassium Chloride, Choline Chloride, 
Ascorbic Acid, Carrageenan, Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl 
Acetate, Niacinamide, Calcium Pantothenate, Manganese Sulfate, Cupric 
5 Sulfate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 

Riboflavin, Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, 
Sodium Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobaiamin and Vitamin D3. 

Protein 

*0 The protein source is a blend of two high-biologic-value proteins: casein 

and soy. 

Sodium and calcium caseinates 84% 
Soy protein isolate 1 6% 

Fat 

1 5 The fat source is corn oil. 

Corn oil 100% 
Carbohydrate 

ENSURE PLUS contains a combination of maltodextrin and sucrose. 
The mild sweetness and flavor variety (vanilla, chocolate, strawberry, coffee, 
20 buffer pecan, and eggnog), plus VARI-FLAVORS® Flavor Pacs in pecan, 

cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 
patient compliance. 

Vanilla, strawberry, butter pecan, and coffee flavors 

Com Syrup 390/, 
25 Maltodextrin 38% 

Sucrose 23% 
Chocolate and eggnog flavors 

Corn Syrup 36% 
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Maltodextrin 34% 
Sucrose 30% 
Vitamins and Minerals 

An 8-fl-oz serving of ENSURE PLUS provides at least 15% of the RDIs 
for 25 key Vitamins and minerals. 

Caffeine 

Chocolate flavor contains 3.1 mg Caffeine/8 fl 02. Coffee flavor 
contains a trace amount of caffeine. 



1 0 F. ENSURE PLUS® HN 

Usage: ENSURE PLUS HN is a nutritionally complete high-calorie, 
high-nitrogen liquid food designed for people with higher calorie and protein 
needs or limited volume tolerance. It may be used for oral supplementation or 
for total nutritional support by tube. ENSURE PLUS HN is lactose- and gluten- 
15 free. 

Patient Conditions: 

• For patients with increased calorie and protein needs, such as following 
surgery or injury 

• For patients with limited volume tolerance and early satiety 
20 Features 

• For supplemental or total nutrition 

• For oral or tube feeding 

• 1.5CaVmL 

• High nitrogen 
25 • Calorically dense 
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Ingredients 

Vanilla: ®-D Water, Maltodextrin (Com), Sodium and Calcium Caseinates, 
Corn Oil, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, Potassium 
Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and Artificial 
Flavor, Sodium Citrate, Choline Chloride, Ascorbic Acid, Taurine, L-Carnitine, 
Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Carrageenan, Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, 
Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, 
Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, Sodium 
Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobalamin and Vitamin D 3 . 



G. ENSURE® POWDER 

Usage: ENSURE POWDER (reconstituted with water) is a low-residue 
15 liquid food designed primarily as an oral nutritional supplement to be used with 

or between meals. ENSURE POWDER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions: 

• For patients on modified diets 

20 • For elderly patients at nutrition risk 

• For patients recovering from illness/surgery 

• For patients who need a low-residue diet 
Features 

• Convenient, easy to mix 
25 • Low in saturated fat 

• Contains 9 g of total fat and < 5 mg of cholesterol per serving 

• High in vitamins and minerals 

• For low-cholesterol diets 
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• Lactose-free, easily digested 

Ingredients: ©-D Corn Syrup, Maltodextrin (Corn), Sugar (Sucrose), Corn Oil, 
Sodium and Calcium Caseinates, Soy Protein Isolate, Artificial Flavor, 
Potassium Citrate, Magnesium Chloride, Sodium Citrate, Calcium Phosphate 
5 Tribasic, Potassium Chloride, Soy Lecithin, Ascorbic Acid, Choline Chloride, 

Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Thiamine Chloride Hydrochloride, 
Cupric Sulfate, Pyridoxine Hydrochloride, Riboflavin, Vitamin A Palmitate, 
Folic Acid, Biotin, Sodium Molybdate, Chromium Chloride, Potassium Iodide, 
10 Sodium Selenate, Phylloquinone, Vitamin D 3 and Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins: casein 
and soy. 

Sodium and calcium caseinates 84% 
1 5 Soy protein isolate 1 6% 

Fat 

The fat source is com oil. 

Com oil 100% 
Carbohydrate 

20 ENSURE POWDER contains a combination of corn syrup, 

maltodextrin, and sucrose. The mild sweetness of ENSURE POWDER, plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
orange, helps to prevent flavor fatigue and aid in patient compliance. 



Vanilla 

25 Com Syrup 35% 

Maltodextrin 3 5% 

Sucrose 30% 
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H. ENSURE® PUDDING 

Usage: ENSURE PUDDING is a nutrient-dense supplement providing 
balanced nutrition in a nonliquid form to be used with or between meals. It is 
appropriate for consistency-modified diets (e.g., soft, pureed, or full liquid) or 
5 for people with swallowing impairments. ENSURE PUDDING is gluten-free. 

Patient Conditions: 

• For patients on consistency-modified diets (e.g., soft, pureed, or full liquid) 

• For patients with swallowing impairments 
Features 

10 • Rich and creamy, good taste 

• Good source of essential vitamins and minerals Convenient-needs no 
refrigeration 

• Gluten-free 

Nutrient Profile per 5 oz: Calories 250, Protein 10.9%, Total Fat 34.9%, 
1 5 Carbohydrate 54.2% 

Ingredients: 

Vanilla: ©-D Nonfat Milk, Water, Sugar (Sucrose), Partially Hydrogenated 
Soybean Oil, Modified Food Starch, Magnesium Sulfate. Sodium Stearoyl 
Lactylate, Sodium Phosphate Dibasic, Artificial Flavor, Ascorbic Acid, Zinc 

20 Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Choline Chloride, 

Niacinamide, Manganese Sulfate, Calcium Pantothenate, FD&C Yellow #5, 
Potassium Citrate, Cupric Sulfate, Vitamin A Palmitate, Thiamine Chloride 
Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, FD&C Yellow #6, Folic 
Acid, Biotin, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

25 Protein 

The protein source is nonfat milk. 

Nonfat milk 1 00% 
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Fat 

The fat source is hydrogenated soybean oil. 
Hydrogenated soybean oil 1 00% 

Carbohydrate 

ENSURE PUDDING contains a combination of sucrose and modified 
food starch. The mild sweetness and flavor variety (vanilla, chocolate, 
butterscotch, and tapioca) help prevent flavor fatigue. The product contains 9.2 
grams of lactose per serving. 

Vanilla and other nonchocolate flavors 



10 Sucrose 56% 

Lactose 27% 

Modified food starch 17% 
Chocolate 

Sucrose 5go/ 0 

15 Lactose 26% 

Modified food starch 1 6% 



I. ENSURE® WITH FIBER 

Usage: ENSURE WITH FIBER is a fiber-containing, nutritionally 
complete liquid food designed for people who can benefit from increased 
dietary fiber and nutrients. ENSURE WITH FIBER is suitable for people who 
do not require a low-residue diet. It can be fed orally or by tube, and can be 
used as a nutritional supplement to a regular diet or, in appropriate amounts, as 
a meal replacement. ENSURE WITH FIBER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 
Patient Conditions 

• For patients who can benefit from increased dietary fiber and nutrients 
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Features 

• New advanced formula-low in saturated fat, higher in vitamins and minerals 

• Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

5 • Good source of fiber 

• Excellent source of essential vitamins and minerals 

• For low-cholesterol diets 

• Lactose- and gluten-free 
Ingredients 

10 Vanilla: ©-D Water, Maltodextrin (Corn), Sugar (Sucrose), Sodium and 

Calcium Caseinates, Oat Fiber, High-Oleic Safflower Oil, Canola Oil, Soy 
Protein Isolate, Com Oil, Soy Fiber, Calcium Phosphate Tribasic, Magnesium 
Chloride, Potassium Citrate, Cellulose Gel, Soy Lecithin, Potassium Phosphate 
Dibasic, Sodium Citrate, Natural and Artificial Flavors, Choline Chloride, 

15 Magnesium Phosphate, Ascorbic Acid, Cellulose Gum, Potassium Chloride, 

Carrageenan, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Zinc Sulfate, 
Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric Sulfate, 
Vitamin A Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine 
Hydrochloride, Riboflavin, Folic Acid, Chromium Chloride, Biotin, Sodium 

20 Molybdate, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D 3 and 

Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins- casein 
and soy. 

25 Sodium and calcium caseinates 80% 

Soy protein isolate 20% 
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Fat 

The fat source is a blend of three oils: high-oleic safflower, canola, and 



corn. 

High-oleic safflower oil 40% 

5 Canola oil 40% 

Corn oil 20% 



The level of fat in ENSURE WITH FIBER meets American Heart 
Association (AHA) guidelines. The 6 grams of fat in ENSURE WITH FIBER 
represent 22% of the total calories, with 2.01 % of the fat being from saturated 
fatty acids and 6.7% from polyunsaturated fatty acids. These values are within 
the AHA guidelines of < 30% of total calories from fat, < 1 0% of the calories 
from saturated fatty acids, and < 1 0% of total calories from polyunsaturated 
fatty acids. 

Carbohydrate 

ENSURE WITH FIBER contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla, chocolate, and butter 
pecan), plus VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, 
lemon, and orange, help to prevent flavor fatigue and aid in patient compliance. 
Vanilla and other nonchocolate flavors 



20 Maltodextrin 66% 

Sucrose 25% 

Oat Fiber 7% 

Soy Fiber 2% 
Chocolate 

25 Maltodextrin 55% 

Sucrose 36 o /o 

Oat Fiber 7% 
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Soy Fiber 2% 

Fiber 

The fiber blend used in ENSURE WITH FIBER consists of oat fiber and 
soy polysaccharide. This blend results in approximately 4 grams of total dietary 
5 fiber per 8-fl-oz can. The ratio of insoluble to soluble fiber is 95:5. 

The various nutritional supplements described above and known to 
others of skill in the art can be substituted and/or supplemented with the PUFAs 
of this invention. 

J. Oxepa™ Nutritional Product 

1 0 Oxepa is low-carbohydrate, calorically dense enteral nutritional product 

designed for the dietary management of patients with or at risk for ARDS. It 
has a unique combination of ingredients, including a patented oil blend 
containing eicosapentaenoic acid (EPA from fish oil), y-linolenic acid (GLA 
from borage oil), and elevated antioxidant levels. 

1 5 Caloric Distribution: 

• Caloric density is high at 1 .5 Cal/mL (355 Cal/8 fl oz), to minimize the 
volume required to meet energy needs. 



The distribution of Calories in Oxepa is shown in Table 7. 



Table 7. Caloric Distribution of Oxepa 




per 8 fl oz. 


per liter 


%ofCal 


Calories 


355 


1,500 




Fat(g) 


22.2 


93.7 


55.2 


Carbohydrate (g) 


25 


105.5 


28.1 


Protein (g) 


14.8 


62.5 


16.7 


Water (g) 


186 


785 





20 Fat: 

• Oxepa contains 22.2 g of fat per 8-fl oz serving (93 .7 g/L). 

• The fat source is a oil blend of 3 1 .8% canola oil, 25% medium-chain 
triglycerides (MCTs), 20% borage oil, 20% fish oil, and 3.2 % soy lecithin. The 
typical fatty acid profile of Oxepa is shown in Table 8. 
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• Oxepa provides a balanced amount of polyunsaturated, monounsaturated, 
and saturated fatty acids, as shown in Table 10. 

• Medium-chain trigylcerides (MCTs) 25% of the fat blend aid gastric 
emptying because they are absorbed by the intestinal tract without 

5 emulsification by bile acids. 

The various fatty acid components of Oxepa™ nutritional product can 



10 



be substituted and/or supplemented with the PUFAs of this invention. 


Table 8, Typical Fatty Acid Profile 




% Total Fatty 
Acids 


g/8 fl oz* 


g/L* 


Caproic (6:0) 


0.2 


0.04 


0.18 


Caprylic (8:0) 


14.69 


3.1 


13.07 


Capric(10:0) 


11.06 


2.33 


9.87 


Palmitic (16:0) 


5.59 


1.18 


4.98 


Palmitoleic(16:ln-7) 


1.82 


0.38 


1.62 


Stearic (18:0) 


1.84 


0.39 


1.64 


01eic(18:ln-9) 


24.44 


5.16 


21.75 


Linoleic (18:2n-6) 


16.28 


3.44 


14.49 


a-Linolenic (18:3n-3) 


3.47 


0.73 


3.09 


Y-Linolenic (18:3n-6) 


4.82 


1.02 


4.29 


Eicosapentaenoic (20:5n- 
3) 


5.11 


1.08 


4.55 


n-3-Docosapentaenoic 
(22:5n-3) 


0.55 


0.12 


0.49 


Docosahexaenoic (22:6n- 
3) 


2.27 


0.48 


2.02 


Others 


7.55 


1.52 


6.72 


w Fatty acids equal approximately ^5% of total fat. 


Table 9. Fat Profile of Oxeoa. 


% ot total calones from fat 


55.2 


Polyunsaturated fatty acids 


31.44 g/L 


Monounsaturated fatty acids 


25.53 g/L 


Saturated fatty acids 


32.38 g/L 


n-6 to n-3 ratio 


1.75:1 


Cholesterol 


9.49 mg/8 fl oz 
40.1 mg/L 
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Carbohydrate: 

• The carbohydrate content is 25.0 g per 8-fl-oz serving (105.5 g/L). 

• The carbohydrate sources are 45% maltodextrin (a complex carbohydrate) 
and 55% sucrose (a simple sugar), both of which are readily digested and 

5 absorbed. 

• The high-fat and low-carbohydrate content of Oxepa is designed to 
minimize carbon dioxide (C0 2 ) production. High C0 2 levels can complicate 
weaning in ventilator-dependent patients. The low level of carbohydrate also 
may be useful for those patients who have developed stress-induced 

10 hyperglycemia. 

• Oxepa is lactose-free. 

Dietary carbohydrate, the amino acids from protein, and the glycerol 
moiety of fats can be converted to glucose within the body. Throughout this 
process, the carbohydrate requirements of glucose-dependent tissues (such as 

1 5 the central nervous system and red blood cells) are met. However, a diet free of 

carbohydrates can lead to ketosis, excessive catabolism of tissue protein, and 
loss of fluid and electrolytes. These effects can be prevented by daily ingestion 
of 50 to 100 g of digestible carbohydrate, if caloric intake is adequate. The 
carbohydrate level in Oxepa is also sufficient to minimize gluconeogenesis, if 

20 energy needs are being met. 

Protein: 

• Oxepa contains 14.8 g of protein per 8-fl-oz serving (62.5 g/L). 

• The total calorie/nitrogen ratio (150:1) meets the need of stressed patients. 

• Oxepa provides enough protein to promote anabolism and the maintenance 
25 of lean body mass without precipitating respiratory problems. High protein 

intakes are a concern in patients with respiratory insufficiency. Although 
protein has little effect on C0 2 production, a high protein diet will increase 
ventilatory drive. 
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• The protein sources of Oxepa are 86.8% sodium caseinate and 13.2% 
calcium caseinate. 

• As demonstrated in Table 1 1, the amino acid profile of the protein system in 
Oxepa meets or surpasses the standard for high quality protein set by 

5 theNational Academy of Sciences. 

• Oxepa is gluten-free. 

All publications and patent applications mentioned in this specification 
are indicative of the level of skill of those skilled in the art to which this 
0 invention pertains. All publications and patent applications are herein 

incorporated by reference to the same extent as if each individual publication or 
patent application was specifically and individually indicated to be incorporated 
by reference. 

The invention now being fully described, it will be apparent to one of 
5 ordinary skill in the art that many changes and modifications can be made 

thereto without departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 



10 



30 



40 



(1) GENERAL INFORMATION: 



(i) APPLICANT: KNUTZON, DEBORAH 
MURKER JI, PRADIP 
HUANG, YUNG-SHENG 
THURMOND, JENNIFER 
CHAUDHARY, SUNITA 
LEONARD, AMANDA 

*5 (ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR SYNTHESIS 

OF LONG CHAIN POLY -UN SATURATED FATTY ACIDS IN PLANTS 

(iii) NUMBER OF SEQUENCES: 52 

20 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LIMBACH & LIMBACH L.L.P. 
<B) STREET: 2001 FERRY BUILDING 

(C) CITY: SAN FRANCISCO 

(D) STATE: CA 

25 (E) COUNTRY: USA 

(F) ZIP: 94111 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE r Microsoft Word 



(vi) CURRENT APPLICATION DATA: 
35 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/834,033 

(B) FILING DATE: ll-APR-1997 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/833,610 
45 (B) FILING DATE: ll-APR-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: MICHAEL R. WARD 

(B) REGISTRATION NUMBER: 38,351 

50 (C) REFERENCE/ DOCKET NUMBER: CGAB-320 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 433-4150 

(B) TELEFAX: (415) 433-8716 
55 (C) TELEX: N/A 



(2) INFORMATION FOR SEQ ID NO:l: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1617 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 





CGACACTCCT 


TCCTTCTTCT 


CACCCGTCCT 


AGTCCCCTTC 


AACCCCCCTC 


TTTGACAAAG 


60 


15 


ACAACAAACC 


ATGGCTGCTG 


CTCCCAGTGT 


GAGGACGTTT 


ACTCGGGCCG 


AGGTTTTGAA 


120 


TGCCGAGGCT 


CTGAATGAGG 


GCAAGAAGGA 


TGCCGAGGCA 


CCCTTCTTGA 


TGATCATCGA 


180 




CAACAAGGTG 


TACGATGTCC 


GCGAGTTCGT 


CCCTGATCAT 


CCCGGTGGAA 


GTGTGATTCT 


240 


20 


CACGCACGTT 


GGCAAGGACG 


GCACTGACGT 


CTTTGACACT 


TTTCACCCCG 


AGGCTGCTTG 


300 




GGAGACTCTT 


GCCAACTTTT 


ACGTTGGTGA 


TATTGACGAG 


AGCGACCGCG 


ATATCAAGAA 


360 


25 


TGATGACTTT 


GCGGCCGAGG 


TCCGCAAGCT 


GCGTACCTTG 


TTCCAGTCTC 


TTGGTTACTA 


420 


CGATTCTTCC 


AAGGCATACT 


ACGCCTTCAA 


GGTCTCGTTC 


AACCTCTGCA 


TCTGGGGTTT 


480 




GTCGACGGTC 


ATTGTGGCCA 


AGTGGGGCCA 


GACCTCGACC 


CTCGCCAACG 


TGCTCTCGGC 


540 


30 


TGCGCTTTTG 


GGTCTGTTCT 


GGCAGCAGTG 


CGGATGGTTG 


GCTCACGACT 


TTTTGCATCA 


600 




CCAGGTCTTC 


CAGGACCGTT 


TCTGGGGTGA 


TCTTTTCGGC 


GCCTTCTTGG 


GAGGTGTCTG 


660 


35 


CCAGGGCTTC 


TCGTCCTCGT 


GGTGGAAGGA 


CAAGCACAAC 


ACTCACCACG 


CCGCCCCCAA 


720 


CGTCCACGGC 


GAGGATCCCG 


ACATTGACAC 


CCACCCTCTG 


TTGACCTGGA 


GTGAGCATGC 


780 




GTTGGAGATG 


TTCTCGGATG 


TCCCAGATGA 


GGAGCTGACC 


CGCATGTGGT 


CGCGTTTCAT 


840 


40 


GGTCCTGAAC 


CAGACCTGGT 


TTTACTTCCC 


CATTCTCTCG 


TTTGCCCGTC 


TCTCCTGGTG 


900 




CCTCCAGTCC 


ATTCTCTTTG 


TGCTGCCTAA 


CGGTCAG^rr 






Q C A 

y bo 


45 


TGTGCCCATC 


TCGTTGGTCG 


AGCAGCTGTC 


GCTTGCGATG 


CACTGGACCT 


GGTACCTCGC 


1020 




CACCATGTTC 


CTGTTCATCA 


AGGATCCCGT 


CAACATGCTG 


GTGTACTTTT 


TGGTGTCGCA 


1080 




GGCGGTGTGC 


GGAAACTTGT 


TGGCGATCGT 


GTTCTCGCTC 


AACCACAACG 


GTATGCCTGT 


1140 


50 


GATCTCGAAG 


GAGGAGGCGG 


TCGATATGGA 


TTTCTTCACG 


AAGCAGATCA 


TCACGGGTCG 


1200 




TGATGTCCAC 


CCGGGTCTAT 


TTGCCAACTG 


GTTCACGGGT 


GGATTGAACT 


ATCAGATCGA 


1260 


55 


GCACCACTTG 


TTCCCTTCGA 


TGCCTCGCCA 


CAACTTTTCA 


AAGATCCAGC 


CTGCTGTCGA 


1320 




GACCCTGTGC 


AAAAAGTACA 


ATGTCCGATA 


CCACACCACC 


GGTATGATCG 


AGGGAACTGC 


1380 




AGAGGTCTTT 


AGCCGTCTGA 


ACGAGGTCTC 


CAAGGCTGCC 


TCCAAGATGG 


GTAAGGCGCA 


1440 


60 


GTAAAAAAAA 


AAACAAGGAC 


GTTTTTTTTC 


GCCAGTGCCT 


GTGCCTGTGC 


CTGCTTCCCT 


1500 




TGTCAAGTCG 


AGCGTTTCTG 


GAAAGGATCG 


TTCAGTGCAG 


TATCATCATT 


CTCCTTTTAC 


1560 
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CCCCCGCTCA TATCTCATTC ATTTCTCTTA TTAAACAACT TGTTCCCCCC TTCACCG 1617 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



15 



20 



30 



35 



45 



50 



60 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 



Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
15 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
23 20 25 30 

Leu Met He He Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 7 ° 75 80 

Ala Asn Phe Tyr Val Gly Asp He Asp Glu Ser Asp Arg Asp He Lys 
85 90 95 

ACi Asn As P As P phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 

40 "o 105 no 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys He Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 ISO 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
!65 170 175 

cc His Gln Val phe Gln As P Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 

33 180 185 190 

Leu Gly Gly Val Cys Gln Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 
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He Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

Leu Phe He Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
325 330 335 

Gin Ala Val Cys Gly Asn Leu Leu Ala He Val Phe Ser Leu Asn His 
340 345 350 

Asn Gly Met Pro Val He Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin He He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
3*70 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys He Gin Pro Ala Val 
405 4io 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
4 20 425 430 

He Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



^ (ii) MOLECULE TYPE: DNA (genomic) 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTCCCCTGTC GCTGTCGGCA CACCCCATCC TCCCTCGCTC CCTCTGCGTT TGTCCTTGGC 60 
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CCACCGTCTC 


TCCTCCACCC 


TCCGAGACGA 


CTGCAACTGT 


AATCAGGAAC 


CGACAAATAC 


120 




ACGATTTCTT 


TTTACTCAGC 


ACCAACTCAA 


AATCCTCAAC 


CGCAACCCTT 


TTTCAGGATG 


180 


5 


GCACCTCCCA 


ACACTATCGA 


TGCCGGTTTG 


ACCCAGCGTC 


ATATCAGCAC 


CTCGGCCCCA 


240 




AACTCGGCCA 


AGCCTGCCTT 


CGAGCGCAAC 


TACCAGCTCC 


CCGAGTTCAC 


CATCAAGGAG 


300 


10 


ATCCGAGAGT 


GCATCCCTGC 


CCACTGCTTT 


GAGCGCTCCG 


GTCTCCGTGG 


TCTCTGCCAC 


360 


GTTGCCATCG 


ATCTGACTTG 


GGCGTCGCTC 


TTGTTCCTGG 


CTGCGACCCA 


GATCGACAAG 


420 




TTTGAGAATC 


CCTTGATCCG 


CTATTTGGCC 


TGGCCTGTTT 


ACTGGATCAT 


GCAGGGTATT 


480 


15 


GTCTGCACCG 


GTGTCTGGGT 


GCTGGCTCAC 


GAGTGTGGTC 


ATCAGTCCTT 


CTCGACCTCC 


540 




AAGACCCTCA 


ACAACACAGT 


TGGTTGGATC 


TTGCACTCGA 


TGCTCTTGGT 


CCCCTACCAC 


600 


20 


TCCTGGAGAA 


TCTCGCACTC 


GAAGCACCAC 


AAGGCCACTG 


GCCATATGAC 


CAAGGACCAG 


660 




GTCTTTGTGC 


CCAAGACCCG 


CTCCCAGGTT 


GGCTTGCCTC 


CCAAGGAGAA 


CGCTGCTGCT 


720 




GCCGTTCAGG 


AGGAGGACAT 


GTCCGTGCAC 


CTGGATGAGG 


AGGCTCCCAT 


TGTGACTTTG 


780 


25 


TTCTGGATGG 


TGATCCAGTT 


CTTGTTCGGA 


TGGCCCGCGT 


ACCTGATTAT 


GAACGCCTCT 


840 




GGCCAAGACT 


ACGGCCGCTG 


GACCTCGCAC 


TTCCACACGT 


ACTCGCCCAT 


CTTTGAGCCC 


900 


30 


CGCAACTTTT 


TCGACATTAT 


TATCTCGGAC 


CTCGGTGTGT 


TGGCTGCCCT 


CGGTGCCCTG 


960 




ATCTATGCCT 


CCATGCAGTT 


GTCGCTCTTG 


ACCGTCACCA 


AGTACTATAT 


TGTCCCCTAC 


1020 




CTCTTTGTCA 


ACTTTTGGTT 


GGTCCTGATC 


ACCTTCTTGC 


AGCACACCGA 


TCCCAAGCTG 


1080 


35 


CCCCATTACC 


GCGAGGGTGC 


CTGGAATTTC 


CAGCGTGGAG 


CTCTTTGCAC 


CGTTGACCGC 


1140 




TCGTTTGGCA 


AGTTCTTGGA 


CCATATGTTC 


CACGGCATTG 


TCCACACCCA 


TGTGGCCCAT 


1200 


40 


CACTTGTTCT 


CGCAAATGCC 


GTTCTACCAT 


GCTGAGGAAG 


CTACCTATCA 


TCTCAAGAAA 


1260 




CTGCTGGGAG 


AGTACTATGT 


GTACGACCCA 


TCCCCGATCG 


TCGTTGCGGT 


CTGGAGGTCG 


1320 




TTCCGTGAGT 


GCCGATTCGT 


GGAGGATCAG 


GGAGACGTGG 


TCTTTTTCAA 


GAAGTAAAAA 


1380 


45 


AAAAGACAAT 


GGACCACACA 


CAACCTTGTC 


TCTACAGACC 


TACGTATCAT 


GTAGCCATAC 


1440 




CACTTCATAA 


AAGAACATGA 


GCTCTAGAGG 


CGTGTCATTC 


GCGCCTCC 




1488 



^ (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
^ <D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
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Met Ala Pro Pro Asn Thr lie Asp Ala Gly Leu Thr Gin Arg His He 
15 10 15 

Ser Thr Ser Ala Pro Asn Ser Ala Lys Pro Ala Phe Glu Arg Asn Tyr 
20 25 30 

Gin Leu Pro Glu Phe Thr He Lys Glu He Arg Glu Cys He Pro Ala 
35 40 45 

His Cys Phe Glu Arg Ser Gly Leu Arg Gly Leu Cys His Val Ala He 
50 55 60 



As P Leu Thr Tr P Ala Ser Leu Leu Phe Leu Ala Ala Thr Gin He Asp 
13 65 70 75 80 



Lys Phe Glu Asn Pro Leu He Arg Tyr Leu Ala Trp Pro Val Tyr Trp 
85 90 95 

He Met Gin Gly lie Val Cys Thr Gly Val Trp Val Leu Ala His Glu 
100 105 110 

Cys Gly His Gin Ser Phe Ser Thr Ser Lys Thr Leu Asn Asn Thr Val 
115 120 125 

Gly Trp He Leu His Ser Met Leu Leu Val Pro Tyr His Ser Trp Ara 
130 135 140 



~ A Ile Ser His s er Lys His His Lys Ala Thr Gly His Met Thr Lys Asp 

30 145 150 155 160 



Gin Val Phe Val Pro Lys Thr Arg Ser Gin Val Gly Leu Pro Pro Lys 
165 170 175 

Glu Asn Ala Ala Ala Ala Val Gin Glu Glu Asp Met Ser Val His Leu 
180 185 190 

Asp Glu Glu Ala Pro Ile Val Thr Leu Phe Trp Met Val Ile Gin Phe 
195 200 205 

Leu Phe Gly Trp Pro Ala Tyr Leu Ile Met Asn Ala Ser Gly Gin Asp 
210 215 220 



A< T V r G1 V Ar <? Trp Thr Ser His Phe His Thr Tyr Ser Pro Ile Phe Glu 

45 225 230 235 240 



Pro Arg Asn Phe Phe Asp Ile Ile He Ser Asp Leu Gly Val Leu Ala 
245 250 255 

Ala Leu Gly Ala Leu Ile Tyr Ala Ser Met Gin Leu Ser Leu Leu Thr 
260 265 270 

Val Thr Lys Tyr Tyr Ile Val Pro Tyr Leu Phe Val Asn Phe Trp Leu 
275 280 285 

Val Leu Ile Thr Phe Leu Gin His Thr Asp Pro Lys Leu Pro His Tyr 
290 295 300 

Arg Glu Gly Ala Trp Asn Phe Gin Arg Gly Ala Leu Cys Thr Val Asp 
305 310 315 320 
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Arg Ser Phe Gly Lys Phe Leu Asp His Met Phe His Gly He Val His 
325 330 335 

Thr His Val Ala His His Leu Phe Ser Gin Met Pro Phe Tyr His Ala 
340 345 350 

Glu Glu Ala Thr Tyr His Leu Lys Lys Leu Leu Gly Glu Tyr Tyr Val 
355 360 365 

Tyr Asp Pro Ser Pro He Val Val Ala Val Trp Arg Ser Phe Arg Glu 
370 375 380 

Cys Arg Phe Val Glu Asp Gin Gly Asp Val Val Phe Phe Lys Lys 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SI 
GCTTCCTCCA GTTCATCCTC CATTTCGCCA 
GGGAACGGAC CAAGGAAAAA CCTTCACCTG 
CGACCTACTC TTGGCCATCC GCGGCAGGGT 
TCCTGGTGGA GTGGACACTC TCCTGCTCGG 
GATGTATCAC GCGTTTGGGG CTGCAGATGC 
GGTCTCGAAT GAGCTGCCCA TCTTCCCGGA 
GAGAGTCGAG GGCTACTTTA CGGATCGGAA 
GGGACGATAC GCTCTTATCT TTGGATCCTT 
GCCTTTCGTT GTCGAACGCA CATGGCTTCA 
GTGCGCACAA GTCGGACTCA ACCCTCTTCA 
CCCCACTGTC TGGAAGATTC TGGGAGCCAC 
GGTGTGGATG TACCAACATA TGCTCGGCCA 
TCCCGACGTG TCGACGTCTG AGCCCGATGT 
TGTCAACCAC ATCAACCAGC ACATGTTTGT 
GGTGCGCATT CAGGACATCA ACATTTTGTA 
CAATCCCATC TCGACATGGC ACACTGTGAT 



ID NO:5: 



CCTGCATTCT 


TTACGACCGT 


TAAGCAAGAT 


60 


GGAAGAGCTG 


GCGGCCCATA 


ACACCAAGGA 


120 


GTACGATGTC 


ACAAAGTTCT 


TGAGCCGCCA 


180 


AGCTGGCCGA 


GATGTTACTC 


CGGTCTTTGA 


240 


CATTATGAAG 


AAGTACTATG 


TCGGTACACT 


300 


GCCAACGGTG 


TTCCACAAAA 


CCATCAAGAC 


360 


CATTGATCCC 


AAGAATAGAC 


CAGAGATCTG 


420 


GATCGCTTCC 


TACTACGCGC 


AGCTCTTTGT 


480 


GGTGGTGTTT 


GCAATCATCA 


TGGGATTTGC 


540 


TGATGCGTCT 


CACTTTTCAG 


TGACCCACAA 


600 


GCACGACTTT 


TTCAACGGAG 


CATCGTACCT 


660 


TCACCCCTAC 


ACCAACATTG 


CTGGAGCAGA 


720 


TCGTCGTATC 


AAGCCCAACC 


AAAAGTGGTT 


780 


TCCTTTCCTG 


TACGGACTGC 


TGGCGTTCAA 


840 


CTTTGTCAAG 


ACCAATGACG 


CTATTCGTGT 


900 


GTTCTGGGGC 


GGCAAGGCTT 


TCTTTGTCTG 


960 
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GTATCGCCTG ATTGTTCCCC TGCAGTATCT GCCCCTGGGC AAGGTGCTGC TCTTGTTCAC 1020 
GGTCGCGGAC ATGGTGTCGT CTTACTGGCT GGCGCTGACC TTCCAGGCGA ACCACGTTGT 1080 
TGAGGAAGTT CAGTGGCCGT TGCCTGACGA GAACGGGATC ATCCAAAAGG ACTGGGCAGC 1140 
TATGCAGGTC GAGACTACGC AGGATTACGC ACACGATTCG CACCTCTGGA CCAGCATCAC 1200 
TGGCAGCTTG AACTACCAGG CTGTGCACCA TCTGTTCCCC AACGTGTCGC AGCACCATTA 12 60 
TCCCGATATT CTGGCCATCA TCAAGAACAC CTGCAGCGAG TACAAGGTTC. CATACCTTGT 1320 
CAAGGATACG TTTTGGCAAG CATTTGCTTC ACATTTGGAG CACTTGCGTG TTCTTGGACT 1380 
15 CCGTCCCAAG GAAGAGTAGA AGAAAAAAAG CGCCGAATGA AGTATTGCCC CCTTTTTCTC 14 40 

CAAGAATGGC AAAAGGAGAT CAAGTGGACA TTCTCTATGA AGA 14 83 

(2) INFORMATION FOR SEQ ID NO: 6: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
25 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gly Thr Asp Gin Gly Lys Thr Phe Thr Trp Glu Glu Leu Ala Ala 
15 io is 

His Asn Thr Lys Asp Asp Leu Leu Leu Ala lie Arg Gly Arg Val Tyr 
20 25 30 

Asp Val Thr Lys Phe Leu Ser Arg His Pro Gly Gly Val Asp Thr Leu 
35 40 45 

Leu Leu Gly Ala Gly Arg Asp Val Thr Pro Val Phe Glu Met Tyr His 
50 55 60 

Ala Phe Gly Ala Ala Asp Ala lie Met Lys Lys Tyr Tyr Val Gly Thr 
65 7 0 75 80 

Leu Val Ser Asn Glu Leu Pro He Phe Pro Glu Pro Thr Val Phe His 
85 90 95 

Lys Thr He Lys Thr Arg Val Glu Gly Tyr Phe Thr Asp Arg Asn He 
100 105 no 

Asp Pro Lys Asn Arg Pro Glu He Trp Gly Arg Tyr Ala Leu He Phe 
115 120 125 

Gly Ser Leu He Ala Ser Tyr Tyr Ala Gin Leu Phe Val Pro Phe Val 
130 135 140 

Val Glu Arg Thr Trp Leu Gin Val Val Phe Ala He He Met Gly Phe 
145 150 155 160 
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Ala Cys Ala Gin Val Gly Leu Asn Pro Leu His Asp Ala Ser His Phe 
165 170 175 

Ser Val Thr His Asn Pro Thr Val Trp Lys lie Leu Gly Ala Thr His 
180 185 190 

Asp Phe Phe Asn Gly Ala Ser Tyr Leu Val Trp Met Tyr Gin His Met 
195 200 205 

Leu Gly His His Pro Tyr Thr Asn He Ala Gly Ala Asp Pro Asp Val 
210 215 220 



Ser Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp 
15 225 230 235 240 



Phe Val Asn His He Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly 
245 250 255 

Leu Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe 
260 265 270 

Val Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His 
275 280 285 

Thr Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu 
290 295 300 



He Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe 
30 305 310 315 320 



Thr Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin 
325 330 335 

Ala Asn His Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn 
340 345 350 

Gly He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin 
355 360 365 

Asp Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu 
370 375 380 



Asn Tyr Gin Ala Val His His Leu Phe Pro Asn Val Ser Gin His His 
45 385 390 395 400 



Tyr Pro Asp He Leu Ala He He Lys Asn Thr Cys Ser Glu Tyr Lys 
405 410 415 

Val Pro Tyr Leu Val Lys Asp Thr Phe Trp Gin Ala Phe Ala Ser His 
420 425 430 

Leu Glu His Leu Arg Val Leu Gly Leu Arg Pro Lys Glu Glu 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 
60 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Glu Val Arg Lys Leu Arg Thr Leu Phe Gin Ser Leu Gly Tyr Tyr Asp 
10 1 5 10 15 



Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val Ser Phe Asn Leu Cys He 
20 25 30 

Trp Gly Leu Ser Thr Val He Val Ala Lys Trp Gly Gin Thr Ser Thr 
35 40 45 

Leu Ala Asn Val Leu Ser Ala Ala Leu Leu Gly Leu Phe Trp Gin Gin 
50 55 60 

Cys Gly Trp Leu Ala His Asp Phe Leu His His Gin Val Phe Gin Asp 
65 70 75 80 

Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe Leu Gly Gly Val Cys Gin 
85 90 95 

Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys His Asn Thr His His Ala 
100 105 " HO 

Ala Pro Asn Val His Gly Glu Asp Pro Asp He Asp Thr His Pro Leu 
US 120 125 

Leu Thr Trp Ser Glu His Ala Leu Glu Met Phe Ser Asp Val Pro Asp 
130 135 140 

Glu Glu Leu Thr Arg Met Trp Ser Arg Phe Met Val Leu Asn Gin Thr 
145 150 155 160 

Trp Phe Tyr Phe Pro He Leu Ser Phe Ala Arg Leu Ser Trp Cys Leu 
165 170 175 

Gin Ser He Leu Phe Val Leu Pro Asn Gly Gin Ala His Lys Pro Ser 
180 185 190 

Gly Ala Arg Val Pro He Ser Leu Val Glu Gin Leu Ser Leu Ala Met 
195 200 205 

His Trp Thr Trp Tyr Leu Ala Thr Met Phe Leu Phe He Lys Asp Pro 
210 215 220 

Val Asn Met Leu Val Tyr Phe Leu Val Ser Gin Ala Val Cys Gly Asn 
225 230 235 240 

Leu Leu Ala He Val Phe Ser Leu Asn His Asn Gly Met Pro Val He 
245 250 255 

Ser Lys Glu Glu Ala Val Asp Met Asp Phe Phe Thr Lys Gin He He 
260 265 270 

Thr Gly Arg Asp Val His Pro Gly Leu Phe Ala Asn Trp Phe Thr Gly 
275 280 285 
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Gly Leu Asn Tyr Gin lie Glu His His Leu Phe Pro Ser Met Pro Arg 
290 295 300 

His Asn Phe Ser Lys lie Gin Pro Ala Val Glu Thr Leu Cys Lys Lys 
305 310 315 320 

Tyr Asn Val Arg Tyr His Thr Thr Gly Met He Glu Gly Thr Ala Glu 
325 330 335 

Val Phe Ser Arg Leu Asn Glu Val Ser Lys Ala Ala Ser Lys Met Gly 
340 345 350 

Lys Ala Gin 
355 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS :. 

(A) LENGTH: 104 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: peptide 



30 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Val Thr Leu Tyr Thr Leu Ala Phe Val Ala Ala Asn Ser Leu Gly Val 
1 5 io 15 



„ Leu Tvr G1 y Val Leu Ala Cys Pro Ser Val Xaa Pro His Gin He Ala 

35 20 25 30 



Ala Gly Leu Leu Gly Leu Leu Trp He Gin Ser Ala Tyr He Gly Xaa 
35 40 45 

Asp Ser Gly His Tyr Val He Met Ser Asn Lys Ser Asn Asn Xaa Phe 
50 55 60 

Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly He He Ala Trp Trp 
65 70 75 80 

Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr 
85 90 95 

Gly Pro Asn Leu Gin His He Pro 
100 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



60 (ii) MOLECULE TYPE: peptide 



-143- 



BNSDOClD:<WO 9846764A1> 



WO 98/46764 PCT/US98/07421 



10 



20 



25 



35 



40 



50 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Gly Val Leu Tyr Gly Val Leu Ala Cys Thr Ser Val Phe Ala His Gin 
15 io is 

lie Ala Ala Ala Leu Leu Gly Leu Leu Trp lie Gin Ser Ala Tyr He 
20 25 30 

Gly His Asp Ser Gly His Tyr Val He Met Ser Asn Lys Ser Tyr Asn 
35 40 45 



Arg Phe Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly He Ser He 
15 50 55 60 



Ala Trp Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser 
65 70 75 80 

Leu Asp Tyr Asp Pro Asp Leu Gin His He Pro Val Phe Ala Val Ser 
85 90 95 

Thr Lys Phe Phe Ser Ser Leu Thr Ser Arg Phe Tyr Asp Arg Lys Leu 
100 105 HO 

Thr Phe Gly Pro Val Ala Arg Phe Leu Val Ser Tyr Gin His Phe Thr 
115 120 125 



OA T y r T yr Pro Val Asn Cys Phe Gly Arg He Asn Leu Phe He Gin Thr 

130 135 140 



Phe Leu Leu Leu Phe Ser Lys Arg Glu Val Pro Asp Arg Ala Leu Asn 
145 150 155 160 

Phe Ala Gly He Leu Val Phe Trp Thr Trp Phe Pro Leu Leu Val Ser 
165 170 175 

Cys Leu Pro Asn Trp Pro Glu Arg Phe Phe Phe Val Phe Thr Ser Phe 
180 185 190 

Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu Asn His Phe Ala 
195 200 205 



AC Ala As P v al Tyr Val Gly Pro Pro Thr Gly Ser Asp Trp Phe Glu Lys 

45 210 215 220 



Gin Ala Ala Gly Thr He Asp He Ser Cys Arg Ser Tyr Met Asp Trp 
225 230 235 240 

Phe Phe Gly Gly Leu Gin Phe Gin Leu Glu His His 
245 250 

(2) INFORMATION FOR SEQ ID NO: 10: 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Asn Phe Ala Gly lie Leu Val Phe Trp Thr Trp Phe Pro 
1 5 10 15 

trt Leu Leu Val Ser Cys Leu Pro Asn Trp Pro Glu Arg Phe Xaa Phe Val 

10 20 25 30 



15 



20 



45 



50 



60 



Phe Thr Gly Phe Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu 
35 40 45 

Asn His Phe Ala Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp 
50 55 60 

Trp Phe Glu Lys Gin Ala Ala Gly Thr He Asp lie Ser Cys Arg Ser 
65 70 75 80 

Tyr Met Asp Trp Phe Phe Cys Gly Leu Gin Phe Gin Leu Glu His His 
85 90 95 



„ Leu Phe Pro Arg Leu Pro Arg Cys His Leu Arg Lys Val Ser Pro Val 

Z5 100 105 110 

Gly Gin Arg Gly Phe Gin Arg Lys Xaa Asn Leu Ser Xaa 
115 120 125 

30 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Ala Thr Glu Val Gly Gly Leu Ala Trp Met He Thr Phe Tyr Val 

15 10 15 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
20 25 30 

Gly Leu Phe Phe He Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 

35 40 45 



« Val Thr Gln Met Asn His He Pro Met His He Asp His Asp Arg Asn 

^ 50 55 60 



Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 
65 70 75 80 

Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 
85 90 95 
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His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Xaa Val Ala 
100 105 no 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Ser 
5 115 120 125 

Lys Pro Leu 
130 

10 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



20 



25 



30 



40 



45 



50 



55 



60 



(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Cys Ser Pro Lys Ser Ser Pro Thr Arg Asn Met Thr Pro Ser Pro Phe 
1 5 10 15 

He Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
20 25 30 

Phe Pro Thr Met Pro Arg Cys Asn Leu Asn Arg Cys Met Lys Tyr Val 
35 40 45 

„ L V S Glu Tr P Cys Ala Glu Asn Asn Leu Pro Tyr Leu Val Asp Asp Tyr 

35 50 55 60 



Phe Val Gly Tyr Asn Leu Asn Leu Gin Gin Leu Lys Asn Met Ala Glu 
65 70 75 80 

Leu Val Gin Ala Lys Ala Ala 
85 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Arg His Glu Ala Ala Arg Gly Gly Thr Arg Leu Ala Tyr Met Leu Val 
15 10 15 

Cys Met Gin Trp Thr Asp Leu Leu Trp Ala Ala Ser Phe Tyr Ser Arg 
20 25 30 
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Phe Phe Leu Ser Tyr Ser Pro Phe Tyr Gly Ala Thr Gly Thr Leu Leu 
35 40 45 

Leu Phe Val Ala Val Arg' Val Leu Glu Ser His Trp Phe Val Trp lie 
50 55 60 

Thr Gin Met Asn His lie Pro Lys Glu He Gly His Glu Lys His Arg 
65 70 75 80 

Asp Trp Ala Ser Ser Gin Leu Ala Ala Thr Cys Asn Val Glu Pro Ser 
85 90 95 



Leu phe Ile As P Trp Phe Ser Gly His Leu Asn Phe Gin He Glu His 
ib 10 ° 105 110 



His Leu Phe Pro Thr Met Thr Arg His Asn Tyr Arg Xaa Val Ala Pro 
115 120 125 

Leu Val Lys Ala Phe Cys Ala Lys His Gly Leu His Tyr Glu Val 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 14: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu His His Thr Tyr Thr Asn Ile Ala Gly Ala Asp Pro Asp Val Ser 
15 10 15 

Thr Ser Glu Pro Asp Val Arg Arg lie Lys Pro Asn Gin Lys Trp Phe 
20 25 30 

Val Asn His Ile Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly Leu 
35 40 45 

Leu Ala Phe Lys Val Arg Ile Gin Asp Ile Asn Ile Leu Tyr Phe Val 
50 55 60 

Lys Thr Asn Asp Ala Ile Arg Val Asn Pro Ile Ser Thr Trp His Thr 
65 70 75 80 

Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu Ile 
85 90 95 

Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe Thr 
100 105 no 

Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin Ala 
115 120 125 
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Asn Tyr Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn Gly 
130 135 140 

He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin Asp 
5 145 150 155 160 

Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu Asn 
165 170 175 

10 Tyr Gin Xaa Val His His Leu Phe Pro His 

180 185 

(2) INFORMATION FOR SEQ ID NO: 15: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



20 



25 



30 



40 



45 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Xaa Xaa His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ala Ala Gin He Lys Lys Tyr He Thr Ser Asp Glu Leu Lys Asn 
15 10 15 

<- n His As P Lvs Pro Gly Asp Leu Trp He Ser He Gin Gly Lys Ala Tyr 

DU 20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
35 40 45 

55 Lvs Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 

50 55 60 

Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Glv Tvr Tvr 

60 " 70 75 so 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Val Tyr Arg Lys Leu 
85 90 95 
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Val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His lie 
100 105 HO 

Met Phe Ala Thr Leu Cys Phe He Ala Met Leu Phe Ala Met Ser Val 
115 120 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cys Leu Met Gly Phe Leu Trp He Gin Ser Gly Trp He Gly His Asp 
14 5 150 155 160 



Ala G ly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 
15 165 170 175 



Gly He Phe Ala Ala Asn Cys Leu Ser Gly He Ser He Gly Trp Trp 
180 185 190 

Lys Trp Asn His Asn Ala His His He Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 

Asp Pro Asp Leu Gin Tyr He Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 

Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 235 240 

Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 
245 250 255 

He Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu He Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser He Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg He Met Phe Val He Ala Ser Leu Ser Val Thr 
305 310 315 320 

Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
325 330 335 

Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
340 345 350 

Gly Thr Leu Asp He Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
3 55 360 365 

Gly Leu Gin Phe Gin He Glu His His Leu Phe Pro Lys Met Pro Arg 
37 0 375 380 

Cys Asn Leu Arg Lys He Ser Pro Tyr Val He Glu Leu Cys Lys Lys 
385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 
405 4X0 415 
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Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp He Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr 
5 435 440 445 

(2) INFORMATION FOR SEQ ID NO: 17 : 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Leu Thr Ala Glu Arg He Lys Phe Thr Gin Lys Arg Gly Phe Arg 
1 5 in i c 



25 



30 



5 10 15 

Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 
20 25 30 

Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu He He Val 
35 40 45 

Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val He 
50 55 60 

~- Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala He Ala Leu Ala 

33 65 70 75 80 

Ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 
85 90 95 

40 Ser Asn Pro His He Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 

100 105 110 

Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
115 120 125 

Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp Gly 
130 135 140 

<- A Ala Val Ar 9 Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 

50 145 150 155 160 

Gin Gin Phe Tyr He Trp Gly Leu Tyr Leu Phe He Pro Phe Tyr Trp 
165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 
180 i 8 5 * !9o 

His Lys He Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Leu Gly 
I 95 200 205 

He Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 
21° 215 220 



45 



55 
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Gly Phe Ser lie Pro Glu Val Leu lie Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly lie Val Val Cys Thr He Phe Met Leu Ala His Val Leu 
2 45 250 255 

Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala He Asp 
260 265 270 

Asp Glu Trp Ala He Cys Gin He Arg Thr Thr Ala Asn Phe Ala Thr 
275 280 285 



t . Asn Asn Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 

15 290 295 300 



Thr His His Leu Phe Pro Asn He Cys His He His Tyr Pro Gin Leu 
305 310 315 320 

Glu Asn He He Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 
325 330 335 

Val Tyr Pro Thr Phe Lys Ala Ala He Ala Ser Asn Tyr Arg Trp Leu 
340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 

(2) INFORMATION FOR SEQ ID NO: 18 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Thr Ser Thr Thr Ser Lys Val Thr Phe Gly Lys Ser He Gly Phe 
15 10 15 

Arg Lys Glu Leu Asn Arg Arg Val Asn Ala Tyr Leu Glu Ala Glu Asn 
20 25 30 

He Ser Pro Arg Asp Asn Pro Pro Met Tyr Leu Lys Thr Ala He He 
35 40 45 

Leu Ala Trp Val Val Ser Ala Trp Thr Phe Val Val Phe Gly Pro Asp 
SO 55 60 

Val Leu Trp Met Lys Leu Leu Gly Cys He Val Leu Gly Phe Gly Val 
65 70 75 80 

Ser Ala Val Gly Ph Asn He Ser His Asp Gly Asn His Gly Gly Tyr 
85 90 95 
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Ser Lys Tyr Gin Trp Val Asn Tyr Leu Ser Gly Leu Thr His Asp Ala 
100 105 no 

He Gly Val Ser Ser Tyr Leu Trp Lys Phe Arg His Asn Val Leu His 
115 120 125 

His Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp 
130 135 140 

Glu Leu Val Arg Met Ser Pro Ser Met Glu Tyr Arg Trp Tyr His Arg 
I 45 150 155 160 

Tyr Gin His Trp Phe He Trp Phe Val Tyr Pro Phe He Pro Tyr Tyr 
165 170 175 

Trp Ser He Ala Asp Val Gin Thr Met Leu Phe Lys Arg Gin Tyr His 
180 185 190 



~ A As P His Glu Ile Pr ° Ser Pro Thr Trp Val Asp He Ala Thr Leu Leu 

20 195 200 205 



Ala Phe Lys Ala Phe Gly Val Ala Val Phe Leu Ile Ile Pro lie Ala 
210 215 220 

Val Gly Tyr Ser Pro Leu Glu Ala Val Ile Gly Ala Ser Ile Val Tyr 
225 230 235 240 

Met Thr His Gly Leu Val Ala Cys Val Val Phe Met Leu Ala His Val 
2 45 250 255 

Ile Glu Pro Ala Glu Phe Leu Asp Pro Asp Asn Leu His Ile Asp Asp 
260 265 270 



Glu Trp Ala He Ala Gin Val Lys Thr Thr Val Asp Phe Ala Pro Asn 
^3 275 280 285 



Asn Thr Ile He Asn Trp Tyr Val Gly Gly Leu Asn Tyr Gin Thr Val 
290 295 300 

His His Leu Phe Pro His Ile Cys His Ile His Tyr Pro Lys lie Ala 
305 310 315 320 

Pro He Leu Ala Glu Val Cys Glu Glu Phe Gly Val Asn Tyr Ala Val 
325 330 335 

His Gin Thr Phe Phe Gly Ala Leu Ala Ala Asn Tyr Ser Trp Leu Lys 
340 345 350 



- n L y s Met Ser He Asn Pro Glu Thr Lys Ala Ile Glu Gin 

50 355 360 365 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCAAGCTTCT GCAGGAGCTC TTTTTTTTTT TTTTT 35 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "Synthetic oligonucleotide" 

(ix) FEATURE: 
20 (A) NAME /KEY : misc_feature 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /number* 1 
/note= "N^Inosine or Cytosine" 

25 (ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 27 

(D) OTHER INFORMATION: /number= 2 
/note* "N=lnosine or Cytosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CUACUACUAC UACAYCAYAC NTAYACNAAY AT 32 
(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
45 (A) DESCRIPTION: /desc « "Synthetic oligonucleotide' 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
50 (B) LOCATION: 13 

(D) OTHER INFORMATION: /number* 1 
/note* "N*Inosine or Cytosine" 

(ix) FEATURE: 
55 (A) NAME /KEY : misc_f eature 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /number* 2 
/note= "N=Inosine or Cytosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
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CAUCAUCAUC AUNGGRAANA RRTGRTG 27 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CUACUACUAC UAGGAGTCCT CTACGGTGTT TTG 33 
20 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
35 CAUCAUCAUC AUATGATGCT CAAGCTGAAA CTG 33 

(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Gin Xaa Xaa His His 
1 5 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CUACUACUAC UACTCGAGCA AGATGGGAAC GGACCAAGG 39 
10 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



55 



60 



(it) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

25 CAUCAUCAUC AUCTCGAGCT ACTCTTCCTT GGGACGGAG 39 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: other nucleic acid 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CUACUACUAC UATCTAGACT CGAGACCATG GCTGCTGCTC CAGTGTG 47 
(2) INFORMATION FOR SEQ ID NO: 28: 

45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CAUCAUCAUC AUAGGCCTCG AGTTACTGCG CCTTACCCAT 4 0 
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(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



CUACUACUA CUAGGATCCA TGGCACCTCC CAACACT 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CAUCAUCAU CAUGGTACCT CGAGTTACTT CTTGAAAAAG AC 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2692004) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



GCACGCCGAC 


CGGCGCCGGG 


AGATCCTGGC 


AAAGTATCCA 


GAGATAAAGT 


CCTTGATGAA 


ACCTGATCCC 


AATTTGATAT 


GGATTATAAT 


TATGATGGTT 


CTCACCCAGT 


TGGGTGCATT 


TTACATAGTA 


AAAGACTTGG 


ACTGGAAATG 


GGTCATATTT 


GGGGCCTATG 


CGTTTGGCAG 


TTGCATTAAC 


CACTCAATGA 


CTCTGGCTAT 


TCATGAGATT 


GCCCACAATG 


CTGCCTTTGG 


CAACTGCAAA 


GCAATGTGGA 


ATCGCTGGTT 


TGGAATGTTT 


GCTAATCTTC 


CTATTGGGAT 


TCCATATTCA 


ATTTCCTTTA 


AGAGGTATCA 


CATGGATCAT 


CATCGGTACC 


TTGGAGCTGA 


TGGCGTCGAT 


GTAGATATTC 


CTACCGATTT 


TGAGGGCTGG 


TTCTTCTGTA 


CCGCTTTCAG 


AAAGTTTATA 


TGGGTTATTC 


TTCAGCCTCT 


CTTTTATGCC 


TTTCGACCTC 


TGTTCATCAA 
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10 



15 



20 



25 



30 



CCCCAAACCA 


ATTACGTATC 


TGGAAGTTAT 


CAATACCGTG 


GCACAGGTCA 


CTTTTGACAT 


540 


TTTAATTTAT 


TACTTTTTGG 


GAATTAAATC 


CTTAGTCTAC 


ATGTTGGCAG 


CATCTTTACT 


600 


TGGCCTGGGT 


TTGCACCCAA 


TTTCTGGACA 


TTTTATAGCT 


GAGCATTACA 


TGTTCTTAAA 


660 


GGGTCATGAA 


ACTTACTCAT 


ATTATGGGCC 


TCTGAATTTA 


CTTACCTTCA 


ATGTGGGTTA 


720 


TCATAATGAA 


CATCATGATT 


TCCCCAACAT 


TCCTGGAAAA 


AGTCTTCCAC 


TGGTGAGGAA 


780 


AATAGCAGCT 


GAATACTATG 


ACAACCTCCC 


TCACTACAAT 


TCCTGGATAA 


AAGTACTGTA 


840 


TGATTTTGTG 


ATGGATGATA 


CAATAAGTCC 


CTACTCAAGA 


ATGAAGAGGC 


ACCAAAAAGG 


900 


AGAGATGGTft 


V- l oorto J. nnn 


1 A 1 UA I 1 AtjfT 


GCCAAAGGGA 


TTCTTCTCCA 


AAACTTTAGA 


960 


TGATAAAATG 


GAATTTTTGC 


ATTATTAAAC 


TTGAGACCAG 


TGATGCTCAG 


AAGCTCCCCT 


1020 


GGCACAATTT 


CAGAGTAAGA 


GCTCGGTGAT 


ACCAAGAAGT 


GAATCTGGCT 


TTTAAACAGT 


1080 


CAGCCTGACT 


CTGTACTGCT 


CAGTTTCACT 


CACAGGAAAC 


TTGTGACTTG 


TGTATTATCG 


1140 


TCATTGAGGA 


TGTTTCACTC 


ATGTCTGTCA 


TTTTATAAGC 


ATATCATTTA 


AAAAGCTTCT 


1200 


AAAAAG CT AT 


TTCGCCAGG 










1219 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2153526) 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



45 


TTACCTTCTA 


CGTCCGCTTC 


TTCCTCACTT 


ATGTGCCACT 


ATTGGGGCTG 


AAAGCTTCCT 


60 




GGGCCTTTTC 


TTCATAGTCA 


GGTTCCTGGA 


AAGCAACTGG 


TTTGTGTGGG 


TGACACAGAT 


120 




GAACCATATT 


CCCATGCACA 


TTGATCATGA 


CCGGAACATG 


GACTGGGTTT 


CCACCCAGCT 


180 


50 


CCAGGCCACA 


TGCAATGTCC 


ACAAGTCTGC 


CTTCAATGAC 


TGGTTCAGTG 


GACACCTCAA 


240 




CTTCCAGATT 


GAGCACCATC 


TTTTTCCCAC 


GATGCCTCGA 


CACAATTACC 


ACAAAGTGGC 


300 


55 


TCCCCTGGTG 


CAGTCCTTGT 


GTGCCAAGCA 


TGGCATAGAG 


TACCAGTCCA 


AGCCCCTGCT 


360 




GTCAGCCTTC 


GCCGACATCA 


TCCACTCACT 


AAAGGAGTCA 


GGGCAGCTCT 


GGCTAGATGC 


420 




CTATCTTCAC 


CAATAACAAC 


AGCCACCCTG 


CCCAGTCTGG 


AAGAAGAGGA 


GGAAGACTCT 


480 


60 


GGAGCCAAGG 


CAGAGGGGAG 


CTTGAGGGAC 


AATGCCACTA 


TAGTTTAATA 


CTCAGAGGGG 


540 




GTTGGGTTTG 


GGGACATAAA 


GCCTCTGACT 


CAAACTCCTC 


CCTTTTATCT 


TCTAGCCACA 


600 
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GTTCTAAGAC CCAAAGTGGG GGGTGGACAC AGAAGTCCCT AGGAGGGAAG GAGCT 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3506132) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 
TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 
CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 
AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 
CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 
AAGA 

(2) INFORMATION FOR SEQ ID NO: 34: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CAGGGACCTA 


CCCCGCGCTA 


CTTCACCTGG 


GACGAGGTGG 


CCCAGCGCTC 


AGGGTGCGAG 


GAGCGGTGGC 


TAGTGATCGA 


CCGTAAGGTG 


TACAACATCA 


GCGAGTTCAC 


CCGCCGGCAT 


CCAGGGGGCT 


CCCGGGTCAT 


CAGCCACTAC 


GCCGGGCAGG 


ATGCCACGGA 


TCCCTTTGTG 


GCCTTCCACA 


TCAACAAGGG 


CCTTGTGAAG 


AAGTATATGA 


ACTCTCTCCT 


GATTGGAGAA 


CTGTCTCCAG 


AGCAGCCCAG 


CTTTGAGCCC 


ACCAAGAATA 


AAGAGCTGAC 


AGATGAGTTC 


CGGGAGCTGC 


GGGCCACAGT 


GGAGCGGATG 


GGGCTCATGA 


AGGCCAACCA 


TGTCTTCTTC 


CTGCTGTACC 


TGCTGCACAT 


CTTGCTGCTG 


GATGGTGCAG 


CCTGGCTCAC 


CCTTTGGGTC 


TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGGCC 


CAGGCTGGCT 


GGCTGCAGCA 


TGACTTTGGG 


CACCTGTCGG 


TCTTCAGCAC 


CTCAAAGTGG 


AACCATCTGC 


TACATCATTT 


TGTGATTGGC 


CACCTGAAGG 


GGGCCCCCGC 


CAGTTGGTGG 


AACCACATGC 


ACTTCCAGCA 


CCATGCCAAG 


CCCAACTGCT 


TCCGCAAAGA 


CCCAGACATC 
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AACATGCATC CCTTCTTCTT TGCCTTGGGG AAGATCCTCT CTGTGGAGCT TGGGAAACAG 720 

AAGAAAAAAT ATATGCCGTA CAACCACCAG CACARATACT TCTTCCTAAT TGGGCCCCCA 7 80 

GCCTTGCTGC CTCTCTACTT CCAGTGGTAT ATTTTCTATT TTGTTATCCA GCGAAAGAAG 840 

TGGGTGGACT TGGCCTGGAT CAGCAAACAG GAATACGATG AAGCCGGGCT TCCATTGTCC 900 

10 ACCGCAAATG CTTCTAAA 



(2) INFORMATION FOR SEQ ID NO: 35: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2511785) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
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918 





GCCACTTAAA 


GGGTGCCTCT 


GCCAACTGGT 


GGAATCATCG 


CCACTTCCAG 


CACCACGCCA 


60 




AGCCTAACAT 


CTTCCACAAG 


GATCCCGATG 


TGAACATGCT 


GCACGTGTTT 


GTTCTGGGCG 


120 


30 


AATGGCAGCC 


CATCGAGTAC 


GGCAAGAAGA 


AGCTGAAATA 


CCTGCCCTAC 


AATCACCAGC 


180 




ACGAATACTT 


CTTCCTGATT 


GGGCCGCCGC 


TGCTCATCCC 


CATGTATTTC 


CAGTACCAGA 


240 


35 


TCATCATGAC 


CATGATCGTC 


CATAAGAACT 


GGGTGGACCT 


GGCCTGGGCC 


GTCAGCTACT 


300 




ACATCCGGTT 


CTTCATCACC 


TACATCCCTT 


TCTACGGCAT 


CCTGGGAGCC 


CTCCTTTTCC 


360 




TCAACTTCAT 


CAGGTTCCTG 


GAGAGCCACT 


GGTTTGTGTG 


GGTCACACAG 


ATGAATCACA 


420 


40 


TCGTCATGGA 


GATTGACCAG 


GAGGCCTACC 


GTGACTGGTT 


CAGTAGCCAG 


CTGACAGCCA 


480 




CCTGCAACGT 


GGAGCAGTCC 


TTCTTCAACG 


ACTGGTTCAG 


TGGACACCTT 


AACTTCCAGA 


540 


45 


TTGAGCACCA 


CCTCTTCCCC 


ACCATGCCCC 


GGCACAACTT 


ACACAAGATC 


GCCCCGCTGG 


600 




TGAAGTCTCT 


ATGTGCCAAG 


CATGGCATTG 


AATACCAGGA 


GAAGCCGCTA 


CTGAGGGCCC 


660 




TGCTGGACAT 


CATCAGGTCC 


CTGAAGAAGT 


CTGGGAAGCT 


GTGGCTGGAC 


GCCTACCTTC 


720 


50 


ACAAATGAAG 


CCACAGCCCC 


CGGGACACCG 


TGGGGAAGGG 


GTGCAGGTGG 


GGTGATGGCC 


780 




AGAGGAATGA 


TGGGCTTTTG 


TTCTGAGGGG 


TGTCCGAGAG 


GCTGGTGTAT 


GCACTGCTCA 


840 


55 


CGGACCCCAT 


GTTGGATCTT 


TCTCCCTTTC 


TCCTCTCCTT 


TTTCTCTTCA 


CATCTCCCCC 


900 




ATAGCACCCT 


GCCCTCATGG 


GACCTGCCCT 


CCCTCAGCCG 


TCAGCCATCA 


GCCATGGCCC 


960 




TCCCAGTGCC 


TCCTAGCCCC 


TTCTTCCAAG 


GAGCAGAGAG 


GTGGCCACCG 


GGGGTGGCTC 


1020 


60 


TGTCCTACCT 


CCACTCTCTG 


CCCCTAAAGA 


TGGGAGGAGA 


CCAGCGGTCC 


ATGGGTCTGG 


1080 




CCTGTGAGTC 


TCCCCTTGCA 


GCCTGGTCAC 


TAGGCATCAC 


CCCCGCTTTG 


GTTCTTCAGA 


1140 
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10 



TGCTCTTGGG 


GTTCATAGGG 


GCAGGTCCTA 


GTCGGGCAGG 


GCCCCTGACC 


CTCCCGGCCT 


1200 


GGCTTCACTC 


TCCCTGACGG 


CTGCCATTGG 


TCCACCCTTT 


CATAGAGAGG 


CCTGCTTTGT 


1260 


TACAAAGCTC 


GGGTCTCCCT 


CCTGCAGCTC 


GGTTAAGTAC 


CCGAGGCCTC 


TCTTAAGATG 


1320 


TCCAGGGCCC 


CAGGCCCGCG 


GGCACAGCCA 


GCCCAAACCT 


TGGGCCCTGG 


AAGAGTCCTC 


1380 


CACCCCATCA 


CTAGAGTGCT 


CTGACCCTGG 




CaCCCCATTCC 


n m 

ACCGCCTCCC 


1440 


CAACTTGAGC 


CTGTGACCTT 


GGGACCAAAG 


GGGGAGTCCC 


TCGTCTCTTG 


TGACTCAGCA 


1500 


GAGGCAGTGG 


CCACGTTCAG 


GGAGGGGCCG 


GCTGGCCTGG 


AGGCTCAGCC 


CACCCTCCAG 


1560 


CTTTTCCTCA 


GGGTGTCCTG 


AGGTCCAAGA 


TTCTGGAGCA 


ATCTGACCCT 


TCTCCAAAGG 


1620 


CTCTGTTATC 


AGCTGGGCAG 


TGCCAGCCAA 


TCCCTGGCCA 


TTTGGCCCCA 


GGGGACGTGG 


1680 


GCCCTG 












1686 



(2) INFORMATION FOR SEQ ID NO: 36: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: other nucleic acid (Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



35 





GTCTTTTACT 


TTGGCAATGG 


CTGGATTCCT 


ACCCTCATCA 


CGGCCTTTGT 


CCTTGCTACC 


60 




TCTCAGGCCC 


AAGCTGGATG 


GCTGCAACAT 


GATTATGGCC 


ACCTGTCTGT 


CTACAGAAAA 


120 


40 


CCCAAGTGGA 


ACCACCTTGT 


CCACAAATTC 


GTCATTGGCC 


ACTTAAAGGG 


TGCCTCTGCC 


180 




AACTGGTGGA 


ATCATCGCCA 


CTTCCAGCAC 


CACGCCAAGC 


CTAACATCTT 


CCACAAGGAT 


240 


45 


CCCGATGTGA 


ACATGCTGCA 


CGTGTTTGTT 


CTGGGCGAAT 


GGCAGCCCAT 


CGAGTACGGC 


300 




AAGAAGAAGC 


TGAAATACCT 


GCCCTACAAT 


CACCAGCACG 


AATACTTCTT 


CCTGATTGGG 


360 




CCGCCGCTGC 


TCATCCCCAT 


GTATTTCCAG 


TACCAGATCA 


TCATGACCAT 


GATCGTCCAT 


420 


50 


AAGAACTGGG 


TGGACCTGGC 


CTGGGCCGTC 


AGCTACTACA 


TCCGGTTCTT 


CATCACCTAC 


480 




ATCCCTTTCT 


ACGGCATCCT 


GGGAGCCCTC 


CTTTTCCTCA 


ACTTCATCAG 


GTTCCTGGAG 


540 


55 


AGCCACTGGT 


TTGTGTGGGT 


CACACAGATG 


AATCACATCG 


TCATGGAGAT 


TGACCAGGAG 


600 




GCCTACCGTG 


ACTGGTTCAG 


TAGCCAGCTG 


ACAGCCACCT 


GCAACGTGGA 


GCAGTCCTTC 


660 




TTCAACGACT 


GGTTCAGTGG 


ACACCTTAAC 


TTCCAGATTG 


AGCACCACCT 


CTTCCCCACC 


720 


60 


ATGCCCCGGC 


ACAACTTACA 


CAAGATCGCC 


CCGCTGGTGA 


AGTCTCTATG 


TGCCAAGCAT 


780 




GGCATTGAAT 


ACCAGGAGAA 


GCCGCTACTG 


AGGGCCCTGC 


TGGACATCAT 


CAGGTCCCTG 


840 
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AAGAAGTCTG GGAAGCTGTG GCTGGACGCC TACCTTCACA AATGAAGCCA CAGCCCCCGG 900 

GACACCGTGG GGAAGGGGTG CAGGTGGGGT GATGGCCAGA GGAATGATGG GCTTTTGTTC 960 

TGAGGGGTGT CCGAGAGGCT GGTGTATGCA CTGCTCACGG ACCCCATGTT GGATCTTTCT 1020 

CCCTTTCTCC TCTCCTTTTT CTCTTCACAT CTCCCCCATA GCACCCTGCC CTCATGGGAC 1080 

10 CTGCCCTCCC TCAGCCGTCA GCCATCAGCC ATGGCCCTCC CAGTGCCTCC TAGCCCCTTC 1140 

TTCCAAGGAG CAGAGAGGTG GCCACCGGGG GTGGCTCTGT CCTACCTCCA CTCTCTGCCC 1200 

CTAAAGATGG GAGGAGACCA GCGGTCCATG GGTCTGGCCT GTGAGTCTCC CCTTGCAGCC 12 60 

TGGTCACTAG GCATCACCCC CGCTTTGGTT CTTCAGATGC TCTTGGGGTT CATAGGGGCA 1320 

GGTCCTAGTC GGGCAGGGCC CCTGACCCTC CCGGCCTGGC TTCACTCTCC CTGACGGCTG 1380 

20 CCATTGGTCC ACCCTTTCAT AGAGAGGCCT GCTTTGTTAC AAAGCTCGGG TCTCCCTCCT 1440 

GCAGCTCGGT TAAGTACCCG AGGCCTCTCT TAAGATGTCC AGGGCCCCAG GCCCGCGGGC 1500 

^ ACAGCCAGCC CAAACCTTGG GCCCTGGAAG AGTCCTCCAC CCCATCACTA GAGTGCTCTG 1560 

ACCCTGGGCT TTCACGGGCC CCATTCCACC GCCTCCCCAA CTTGAGCCTG TGACCTTGGG 1620 

ACCAAAGGGG GAGTCCCTCG TCTCTTGTGA CTCAGCAGAG GCAGTGGCCA CGTTCAGGGA 1680 

30 GGGGCCGGCT GGCCTGGAGG CTCAGCCCAC CCTCCAGCTT TTCCTCAGGG TGTCCTGAGG 17 40 

TCCAAGATTC TGGAGCAATC TGACCCTTCT CCAAAGGCTC TGTTATCAGC TGGGCAGTGC 1800 

CAGCCAATCC CTGGCCATTT GGCCCCAGGG GACGTGGGCC CTG 1843 



35 



60 



(2) INFORMATION FOR SEQ ID NO: 37; 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

45 <ii) MOLECULE TYPE: other nucleic acid (Edited Contig 253538a) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

^ CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG TACAACATCA GCGAGTTCAC CCGCCGGCAT 120 

CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

55 GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 240 

CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 

CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 360 

CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 420 
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TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 




GCCCAAGCTG 


GATGGCTGCA 


ACATGATTAT 


5 


TGGAACCACC 


TTGTCCACAA 


ATTCGTCATT 




TGGAATCATC 


GCCACTTCCA 


GCACCACGCC 


10 


GTGAACATGC 


TGCACGTGTT 


TGTTCTGGGC 


AAGCTGAAAT 


ACCTGCCCTA 


CAATCACCAG 




CTGCTCATCC 


CCATGTATTT 


CCAGTACCAG 


15 


TGGGTGGACC 


TGGCCTGGGC 


CGTCAGCTAC 




TTCTACGGCA 


TCCTGGGAGC 


CCTCCTTTTC 


20 


TGGTTTGTGT 


GGGTCACACA 


GATGAATCAC 


CGTGACTGGT 


TCAGTAGCCA 


GCTGACAGCC 




GACTGGTTCA 


GTGGACACCT 


TAACTTCCAG 


25 


CGGCACAACT 


TACACAAGAT 


CGCCCCGCTG 




GAATACCAGG 


AGAAGCCGCT 


ACTGAGGGCC 


30 


TCTGGGAAGC 


TGTGGCTGGA 


CGCCTACCTT 


GTGGGGAAGG 


GGTGCAGGTG 


GGGTGATGGC 




GTGTCCGAGA 


GGCTGGTGTA 


TGCACTGCTC 


35 


CTCCTCTCCT 


TTTTCTCTTC 


ACATCTCCCC 




TCCCTCAGCC 


GTCAGCCATC 


AGCCATGGCC 


40 


GGAGCAGAGA 


GGTGGCCACC 


GGGGGTGGCT 


ATGGGAGGAG 


ACCAGCGGTC 


CATGGGTCTG 




CTAGGCATCA 


CCCCCGCTTT 


GGTTCTTCAG 


45 


AGTCGGGCAG 


GGCCCCTGAC 


CCTCCCGGCC 




GTCCACCCTT 


TCATAGAGAG 


GCCTGCTTTG 


50 


CGGTTAAGTA 


CCCGAGGCCT 


CTCTTAAGAT 


AGCCCAAACC 


TTGGGCCCTG 


GAAGAGTCCT 




GGCTTTCACG 


GGCCCCATTC 


CACCGCCTCC 


55 


GGGGGAGTCC 


CTCGTCTCTT 


GTGACTCAGC 




GGCTGGCCTG 


GAGGCTCAGC 


CCACCCTCCA 


60 


ATTCTGGAGC 


AATCTGACCC 


TTCTCCAAAG 


ATCCCTGGCC 


ATTTGGCCCC 


AGGGGACGTG 



TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGCAG 


480 


GGCCACCTGT 


CTGTCTACAG 


AAAACCCAAG 


540 


GGCCACTTAA 


AGGGTGCCTC 


TGCCAACTGG 


600 


AAGCCTAACA 


TCTTCCACAA 


GGATCCCGAT 


660 


GAATGGCAGC 


CCATCGAGTA 


CGGCAAGAAG 


720 


CACGAATACT 


TCTTCCTGAT 


TGGGCCGCCG 


780 


ATCATCATGA 


CCATGATCGT 


CCATAAGAAC 


840 


TACATCCGGT 


TCTTCATCAC 


CTACATCCCT 


900 


CTCAACTTCA 


TCAGGTTCCT 


GGAGAGCCAC 


960 


ATCGTCATGG 


AGATTGACCA 


GGAGGCCTAC 


1020 


ACCTGCAACG 


TGGAGCAGTC 


CTTCTTCAAC 


1080 


ATTGAGCACC 


ACCTCTTCCC 


CACCATGCCC 


1140 


GTGAAGTCTC 


TATGTGCCAA 


GCATGGCATT 


1200 


CTGCTGGACA 


TCATCAGGTC 


CCTGAAGAAG 


1260 


CACAAATGAA 


GCCACAGCCC 


CCGGGACACC 


1320 


CAGAGGAATG 


ATGGGCTTTT 


GTTCTGAGGG 


1380 


ACGGACCCCA 


TGTTGGATCT 


TTCTCCCTTT 


1440 


CATAGCACCC 


TGCCCTCATG 


GGACCTGCCC 


1500 


CTCCCAGTGC 


CTCCTAGCCC 


CTTCTTCCAA 


1560 


CTGTCCTACC 


TCCACTCTCT 


GCCCCTAAAG 


1620 


GCCTGTGAGT 


CTCCCCTTGC 


AGCCTGGTCA 


1680 


ATGCTCTTGG 


GGTTCATAGG 


GGCAGGTCCT 


1740 


TGGCTTCACT 


CTCCCTGACG 


GCTGCCATTG 


1800 


TTACAAAGCT 


CGGGTCTCCC 


TCCTGCAGCT 


1860 


GTCCAGGGCC 


CCAGGCCCGC 


GGGCACAGCC 


1920 


CCACCCCATC 


ACTAGAGTGC 


TCTGACCCTG 


1980 


CCAACTTGAG 


CCTGTGACCT 


TGGGACCAAA 


2040 


AGAGGCAGTG 


GCCACGTTCA 


GGGAGGGGCC 


2100 


GCTTTTCCTC 


AGGGTGTCCT 


GAGGTCCAAG 


2160 


GCTCTGTTAT 


CAGCTGGGCA 


GTGCCAGCCA 


2220 


GGCCCTG 






2257 
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(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2692004) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



His 


Ala 




Arg 


Arg 


Arg 


(Z 1 n 
ulU 


Tift 

lie 


Leu 


Ala 


Lys 


Tyr 


Pro 


Glu 


He 


1 








•J 










1U 










15 


Lys 


Ser 


Leu 


Met 


Lys 
20 


Pro 


Asp 


Pro 


Asn 


Leu 


lie 


Trp 


He 


He 


He 

JO 


Met 


Met 


Val 


Leu 


35 




Leu 


uiy 


Ala 


Pne 
40 


Tyr 


He 


Val 


Lys 


Asp 
45 


Leu 


Asp 


Trn 

il r J 


Lys 


T m 


Val 


Tip 
X Xt5 


trne 


uiy 


nla 


Tyr 


Ala 


n w a 

Pne 


Gly 


Ser 










50 




















Cys 


He 


Asn 


His 


Ser 
65 


Met 


Thr 


Leu 


J. o 


lie 
70 


nis 




T 1 a 

lie 


Ala 


His 


Asn 


Ala 


Ala 


Phe 


Gly 


Asn 


C\t Q 


.Lys 


nla 


Ma 4- 


Trp 


Asn 


Arg 


Trp 


Phe 










80 










O J 








an 


Gly 


Met 


Phe 


Ala 


Asn 
95 


Leu 


Pro 


lie 

X xc 


uiy 


i le 
100 


Pro 


Tyr 


Ser 


Ti- 
ne 


Ser 
105 


Phe 


Lvs 


Arg 


Tvr 


His 


Met 


Asp 


His 


nxs 


Arg 


Tyr 


Leu 


Caly 


Ala 


Asp 










110 










115 








1 £ u 


Glv 


Val 


Asp 


Val 


Asp 
125 


He 


XT A. sj 




Asp 


f ne 

1 JU 


CjIU 


biy 


Trp 


Phe 


Phe 


Cys 


Thr 


Ala 


Phe 


Arg 
140 


Lys 


Phe 


He 


T Y"r"> 


val 

14c; 
X H J 


ne 


Leu 


uin 


Pro 


T 1 

lieu 

lOU 


Phe 


Tyr 


Ala 


Phe 


Arg 


Pro 


Leu 


Phe 


T I ~ 


Asn 


Pro 


Lys 


Pro 


Ti- 
ne 


Thr 










155 










160 








165 


Tyr 


Leu 


Glu 


Val 


He 


Asn 


Thr 


Val 


r\Xcl 


uin 


vai 


rnl_ _» 

1 nr 


rne 


Asp 


lie 










170 










X / J 








1 0 U 


Leu 


He 


Tyr 


Tyr 


Phe 


Leu 


Gly 


He 


Lys 


Ser 


Leu 


Val 

v d X 


xyr 


lic I* 


Leu 










185 










190 








195 


Ala 


Ala 


Ser 


Leu 


Leu 


Gly 


Leu 


Gly 


Leu 


His 


Pro 


He 


Ser 


Gly 


His 










200 










205 








210 


Phe 


He 


Ala 


Glu 


His 
215 


Tyr 


Met 


Phe 


Leu 


Lys 
220 


Gly 


His 


Glu 


Thr 


Tyr 
225 


Ser 


Tyr 


Tyr 


Gly 


Pro 


Leu 


Asn 


Leu 


Leu 


Thr 


Phe 


Asn 


Val 


Gly 


Tyr 


His 








230 










235 








240 


Asn 


Glu 


His 


His 


Asp 


Phe 


Pro 


Asn 


He 


Pro 


Gly 


Lys 


Ser 


Leu 










245 










250 








255 


Pro 


Leu 


Val 


Arg 


Lys 


He 


Ala 


Ala 


Glu 


Tyr 


Tyr 


Asp 


Asn 


Leu 


Pro 


His 








260 










265 










270 


Tyr 


Asn 


Ser 


Trp 
275 


He 


Lys 


Val 


Leu 


Tyr 
280 


Asp 


Phe 


Val 


Met 


Asp 
285 


Asp 


Thr 


He 


Ser 


Pro 


Tyr 


Ser 


Arg 


Met 


Lys 


Arg 


His 


Gin 


Lys 


Gly 










290 










295 








300 


Glu 


Met 


Val 


Leu 


Glu 


* * * 


He 


Ser 


Leu 


Val 


Pro 


Lys 


Gly 


Phe 


Phe 










305 










310 






315 


Ser 


Lys 


Thr 


Leu 


Asp 


Asp 


Lys 


Met 


Glu 


Phe 


Leu 


His 


Tyr 


★ ★ * 


Thr 


* + * 








320 










325 








330 


Asp 


Gin 


* ★ * 


Cys 
335 


Ser 


Glu 


Ala 


Pro 


Leu 
340 


Ala 


Gin 


Phe 


Gin 


Ser 
345 


Lys 


Ser 


Ser 


Val 


He 


Pro 


Arg 


Ser 


Glu 


Ser 


Gly 


Phe 


* * * 


Thr 


Val 










350 










355 








360 
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Ser Leu Thr Leu Tyr Cys Ser Val Ser Leu Thr Gly Asn Leu *** 
365 370 375 

Leu Val Tyr Tyr Arg His *** Gly Cys Phe Thr His Val Cys His 
380 385 390 

5 Phe lie Ser He Ser Phe Lys Lys Leu Leu Lys Ser Tyr Phe Ala 

400 405 410 

Arg 

(2) INFORMATION FOR SEQ ID NO: 39: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 





Tyr 
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Leu 


Arg 


Pro 


Leu 


Leu 


Pro 
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Leu 


Cys 


Ala 


Thr 


He 


Gly 
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25 
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Glu 
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Phe 
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Leu 


Phe 


Phe 


He 
25 
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Glu 
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Trp 
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Trp 
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Met 
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He 
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Met 
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Trp 
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Gly 


His 
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Thr 
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Lys 


Val 
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Leu 
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100 










105 




Leu 


Cys 
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Lys 
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He 
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Tyr 


Gin 
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Ser 


Lys 
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Leu 


Leu 
120 
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He 


He 
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Ser 


Leu 


Lys 
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Ser 
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Gin 
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Leu 
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140 
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Leu 
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Gin 
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Gin 
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Pro 
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Pro 
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Trp 
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Lys 
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Thr 
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Pro 


Arg 


Gin 
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Gly 
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Thr 
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Leu 
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Pro 
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Trp 
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55 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: amino acid (Translation of Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 
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Trp 
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He 
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(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
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Lys 


Glu 
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Thr 
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Glu 


Phe 
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Arg 
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Glu 
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Met 
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Gly 


Leu 


Met 


Lys 
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115 
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Phe 
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120 


Leu 


Leu 


Tyr 


Leu 


Leu 
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He 


Leu 


Leu 


Leu 
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Ala 


Trp 
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Thr 


Leu 


Trp 


Val 
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Phe 
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Thr 
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Phe 
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Leu 
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Leu 
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Val 
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Ala 
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Trp 


Leu 
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Phe 
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Ser 


Thr 


Ser 


Lys 


Trp 
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Phe 
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He 
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Lys 
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Trp 


Trp 
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Ala 


Lys 
210 
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Pro 


Asn 


Cy s 




Arg 

X 3 


Lys 


Asp 


Pro 


Asp 


He 
220 
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Met 
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Pro 


Phe 
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r l ,. 
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He 
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Pro 
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Leu 


Tyr 


Phe 


Gin 


Trp 


Tyr 
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265 
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He 


Phe 


Tyr 
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Val 


He 


Gin 
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Lys 


Lys 


Trp 


Val 


Asp 
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Ala 
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Trp 


He 


Ser 


Lys 


Gin 


Glu 


Tyr 


Asp 


Glu 


Ala 


Gly 


Leu 


Pro 


Leu 


Ser 












290 










295 








300 




Thr 


Ala 


Asn 


Ala 


Ser 


Lys 





















(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: amino acid (Translation of Contig 2511785) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
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35 
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Phe 
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He 
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70 


Leu 


He 
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Met 
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Tyr 
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He 


He 


Met 


Thr 


Met 


He 
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Asn 


Trp 
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Trp 
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Tyr 


He 


Arg 


Phe 


Phe 


He 


Thr 
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105 


Tyr 


He 


Pro 


Phe 
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Tyr 
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He 


Leu 
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Ala 


Leu 


Leu 


Phe 


Leu 
120 
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Phe 


He 


Arg 


Phe 


Leu 
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Ser 
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Trp 


Phe 


Val 


Trp 


Val 


Thr 
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Met 


Asn 
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He 
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Met 
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He 
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Trp 
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Gin 
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Phe 


Asn 


Asp 


Trp 
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He 
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Pro 
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Lys 
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Leu 
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Lys 


Ser 


Leu 


Cys 


Ala 


Lys 
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He 


Glu 
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Tyr 
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Lys 
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Leu 


Leu 
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He 


He 
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Ser 
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220 
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Lys 
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Trp 
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Cys 


Arg 



-166- 



BNSDOCID:<WO 9846764A1> 











245 






Trp 


Gly Asp 


Gly 
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Pro 
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Pro 
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Met 
350 


Gly 
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Ala 


Cys 


Asp 
490 


Leu 
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Trp 
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Leu 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



Val Phe Tyr Phe Gly Asn Gly Trp He Pro Thr Leu He Thr Ala 
1 5 io is 

Phe Val Leu Ala Thr Ser Gin Ala Gin Ala Gly Trp Leu Gin His 



-167- 



WO 98/46764 



20 





Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr 




Leu 


Val 


His 


Ly s 




v ax 


1 xe 


Gly 


5 










^n 








Asn 


TrD 


Tm 


Asn 


His 

Oj 


rg 




Phe 




lie 


Phe 


His 


Ly s 


Asp 


Pro 


Asp 


vax 


10 










a U 








Leu 


Gly 


Glu 


Trp 


Gin 

Q 


Pro 


He 


Glu 




Tyr 


Leu 


Pro 


Tyr 


Asn 

11U 


His 


Gin 


His 


15 


Pro 


Pro 


jjeu 


Leu 


T1 A 

x xe 


Pro 


Met 


Tyr 


















Thr 


Met 


He 


Val 


His 


Lys 


Asn 


Trp 












x ^ u 








Ser 


Tvr 


■l yx 


T I a 


Arg 


rne 


OK a 

rne 


lie 


20 










ICC 

ISO 








lie 


Leu 


Glv 


Ala 


Leu 

1 Tft 


L€U 


nu — 

rne 


Leu 




Ser 


His 


Trp 


Phe 


Val 

iO J 


Trp 


Val 


Thr 


25 


Glu 


He 


Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 
















Thr 


Ala 


Thr 


Cys 


Asn 

X O 


Val 


Glu 


Gin 




Ser 


Gly 


His 


Leu 


Asn 


Phe 




T 1 A 

xxe 


30 










ZJU 








Met 


Pro 


Ar 5 


His 


Asn 


Leu 


His 


Liys 












Z fl 3 








Leu 


Cys 


Ala 


Lys 


His 

9 fin 


uiy 


x xe 


Glu 


35 


Ar^ 


flXcL 


Leu 


Leu 


TV _ 

ASp 


lie 


He 


Arg 




Leu 


Trp 


Leu 


Asp 


Ala 

t 3 U 


Tyr 


Leu 


His 




Asp 


Thr 


Val 


Gly 


Lys 


ui y 


Cys 


Arg 


40 


















Asp 


Gly 


Leu 


Leu 


Phe 


* + * 


Gly 


Val 












Ton 








Leu 


Leu 


Thr 


Asp 


Pro 


Met 


Leu 


Asp 


















45 


Phe 


•r lie 




Ser 


nis 


Leu 


Pro 


His 


















Leu 


Pro 


Ser 


Leu 


Ser 
365 


Arg 


HI n 


Pro 




Pro 


Pro 


Ser 


Pro 


Phe 


PhD 

irne 


oin 


biy 


50 










380 






Val 


Ala 


Leu 


Ser 


Tyr 
400 


Leu 


His 


Ser 




Asp 


Gin 


Arg 


Ser 


Met 
415 


Gly 


Leu 


Ala 


55 


Trp 


Ser 


Leu 


Gly 


He 


Thr 


Pro 


Ala 








430 








Gly 


Phe 


He 


Gly Ala 


Gly 


Pro 


Ser 












445 










Pro 


Ala 


Trp 


Leu 


His 


Ser 


Pro 


* ** 


60 










460 








Phe 


He 


Glu 


Arg 


Pro 
475 


Ala 


Leu 


Leu 




Ala 


Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


Gly 
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30 


Arg 


Lys 


Pro 


Lys 


Trp 


Asn 


His 




H 0 










45 


His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 




55 










60 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 




70 










75 


Asn 


Met 


Leu 


His 


Val 


Phe 


Val 




85 










90 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 




100 










105 


Glu 


Tyr 


Phe 


Phe 


Leu 


He 


Gly 




115 










120 


Phe 


Gin 


Tyr 


Gin 


He 


He 


Met 




130 










135 


Val 


Asp 


Leu 


Ala 


Trp 


Ala 


Val 




145 










150 


Thr 


Tyr 


He 


Pro 


Phe 


Tyr 


Gly 




160 










165 


Asn 


Phe 


He 


Arg 


Phe 


Leu 


Glu 




175 










180 


Gin 


Met 


Asn 


His 


He 


Val 


Met 




190 










195 


Asp 


Trp 


Phe 


Ser 


Ser 


Gin 


Leu 




one 










210 


Ser 


Phe 


Phe 


Asn 


Asp 


Trp 


Phe 




220 










225 


GlU 


His 


His 


Leu 


Phe 


Pro 


Thr 




*1 *3 C 










240 


He 


Ala 


Pro 


Leu 


Val 


Lys 


Ser 




250 










255 


Tyr 


Gin 


Glu 


Lys 


Pro 


Leu 


Leu 




265 










270 


O A V 

oer 


Leu 


Lys 


Lys 


Ser 


Gly 


Lys 




0 0 n 










285 


Lys 




Ser 


His 


Ser 


Pro 


Arg 




295 










300 


Trp 


Gly 


Asp 


Gly 


Gin 


Arg 


Asn 




j1 0 










315 


Ser 


Glu 


Arg 


Leu 


Val 


Tyr 


Ala 














330 


T At* 

Leu 


Ser 


Pro 


Phe 


Leu 


Leu 


Ser 




34 0 










345 


C A W- 

i>er 


Thr 


Leu 


Pro 


Ser 


Trp 


Asp 




355 










360 


ber 


Ala 


Met 


Ala 


Leu 


Pro 


Val 




0. r\ 

370 










375 


Ala 


Glu 


Arg 


Trp 


Pro 


Pro 


Gly 




385 










390 


Leu 


Pro 


Leu 


Lys 


Met 


Gly 


Gly 




405 










410 


Cys 


GlU 


Ser 


Pro 


Leu 


Ala 


Ala 




420 










A 0 


Leu 


Val 


Leu 


Gin 


Met 


Leu 


Leu 




435 










440 


Arg 


Ala 


Gly 


Pro 


Leu 


Thr 


Leu 




450 










455 


Arg 


Leu 


Pro 


Leu 


Val 


His 


Pro 




465 










470 


Gin 


Ser 


Ser 


Gly 


Leu 


Pro 


Pro 




480 










485 


Leu 


Ser 


+ * + 


Asp 


Val 


Gin 


Gly 
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490 










495 










500 


Pro 


Arg 


Pro 


Ala 


Gly 


Thr 


Ala 


Ser 


Pro 


Asn 


Leu 


Gly 


Pro 


Trp 


Lys 










505 










510 








515 


Ser 


Pro 


Pro 


Pro 


His 


His 


★ * * 


Ser 


Ala 


Leu 


Thr 


Leu 


Gly 


Phe 


His 










520 










525 








530 


Gly 


Pro 


His 


Ser 


Thr 
535 


Ala 


Ser 


Pro 


Thr 


+ * + 

540 


Ala 


Cys 


Asp 


Leu 


Gly 
545 


Thr 


Lys 


Gly 


Gly 


Val 


Pro 


Arg 


Leu 


Leu 


★ * ★ 


Leu 


Ser 


Arg 


Gly 


Ser 










550 










555 






560 


Gly 


His 


Val 


Gin 


Gly 


Gly 


Ala 


Gly 


Trp 


Pro 


Gly 


Gly 


Ser 


Ala 


His 










565 










570 






575 


Pro 


Pro 


Ala 


Phe 


Pro 
580 


Gin 


Gly 


Val 


Leu 


Arg 
585 


Ser 


Lys 


He 


Leu 


Glu 
590 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 










595 










600 








605 


Gin 


Pro 


He 


Pro 


Gly 
610 


His 


Leu 


Ala 


Pro 


Gly 
615 


Asp 


Val 


Gly 


Pro 


Xxx 
620 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 253538a) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Gin 


Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


Ala 


Gin 


1 








5 










10 










15 


Arg 


Ser 


Gly 


Cys 


Glu 


Glu 


Arg 


Trp 


Leu 


Val 


He 


Asp 


Arg 


Lys 


Val 










20 










25 






30 


Tyr 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 


Gly 


Gly 


Ser 


Arg 










35 










40 








45 


Val 


He 


Ser 


His 


Tyr 


Ala 


Gly 


Gin 


Asp 


Ala 


Thr 


Asp 


Pro 


Phe 


Val 










50 










55 








60 


Ala 


Phe 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 


Tyr 


Met 


Asn 


Ser 










65 










70 








75 


Leu 


Leu 


He 


Gly 


Glu 


Leu 


Ser 


Pro 


Glu 


Gin 


Pro 


Ser 


Phe 


Glu 


Pro 










80 










85 










90 


Thr 


Lys 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 


Glu 


Leu 


Arg 


Ala 










95 










100 








105 


Thr 


Val 


Glu 


Arg 


Met 


Gly 


Leu 


Met 


Lys 


Ala 


Asn 


His 


Val 


Phe 


Phe 










110 










115 










120 


Leu 


Leu 


Tyr 


Leu 


Leu 


His 


He 


Leu 


Leu 


Leu 


Asp 


Gly 


Ala 


Ala 


Trp 










125 










130 










135 


Leu 


Thr 


Leu 


Trp 


Val 


Phe 


Gly 


Thr 


Ser 


Phe 


Leu 


Pro 


Phe 


Leu 


Leu 










140 










145 










150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 










155 










160 








165 


Leu 


Gin 


His 


Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Pro 


Lys 










170 










175 






180 


Trp 


Asn 


His 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 










185 










190 








195 


Ala 


Ser 


Ala 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


Gin 


His 


His 


Ala 










200 










205 










210 


Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 


Asn 


Met 


Leu 


His 



-169- 



Val Phe Val 
Lys Leu Lys 
Leu He Gly 
He He Met 
Trp Ala Val 
Phe Tyr Gly 
Phe Leu Glu 
He Val Met 
Ser Gin Leu 
Asp Trp Phe 
Phe Pro Thr 
Val Lys Ser 
Pro Leu Leu 
Ser Gly Lys 
Ser Pro Arg 
Gin Arg Asn 
Val Tyr Ala 
Leu Leu Ser 
Ser Trp Asp 
Leu Pro Val 
Pro Pro Gly 
Met Gly Gly 
Leu Ala Ala 
Met Leu Leu 
Leu Thr Leu 
Val His Pro 
Leu Pro Pro 
Val Gin Gly 
Pro Trp Lys 
Gly Phe His 
Asp Leu Gly 



215 
Leu Gly Glu 

230 
Tyr Leu Pro 

245 
Pro Pro Leu 

260 
Thr Met He 

275 
Ser Tyr Tyr 

290 
He Leu Gly 

305 
Ser His Trp 

320 
Glu He Asp 

335 
Thr Ala Thr 

350 
Ser Gly His 

365 
Met Pro Arg 

380 
Leu Cys Ala 

400 
Arg Ala Leu 

415 
Leu Trp Leu 

430 
Asp Thr Val 

445 
Asp Gly Leu 

460 
Leu Leu Thr 

475 
Phe Phe Ser 

490 
Leu Pro Ser 

505 
Pro Pro Ser 

520 
Val Ala Leu 

535 
Asp Gin Arg 

550 
Trp Ser Leu 

565 
Gly Phe He 

580 
Pro Ala Trp 

595 
Phe He Glu 

610 
Ala Ala Arg 

625 
Pro Arg Pro 

640 
Ser Pro Pro 

655 
Gly Pro His 

670 
Thr Lys Gly 



Trp Gin Pro 
Tyr Asn His 
Leu He Pro 
Val His Lys 
He Arg Phe 
Ala Leu Leu 
Phe Val Trp 
Gin Glu Ala 
Cys Asn Val 
Leu Asn Phe 
His Asn Leu 
Lys His Gly 
Leu Asp He 
Asp Ala Tyr 
Gly Lys Gly 
Leu Phe *** 
Asp Pro Met 
Ser His Leu 
Leu Ser Arg 
Pro Phe Phe 
Ser Tyr Leu 
Ser Met Gly 
Gly He Thr 
Gly Ala Gly 
Leu His Ser 
Arg Pro Ala 
Leu Ser Thr 
Ala Gly Thr 
Pro His His 
Ser Thr Ala 
Gly Val Pro 



220 






He 


Glu 


Tyr 


235 






Gin 


His 


Glu 


250 






Met 


Tyr 


Phe 


265 






Asn 


Trp 


Val 


280 






Phe 


He 


Thr 


295 






Phe 


Leu 


Asn 


310 






Val 


Thr 


Gin 


325 






Tyr 


Arg 


Asp 


340 






Glu 


Gin 


Ser 


355 






Gin 


He 


Glu 


370 






His 


Lys 


He 


385 






He 


Glu 


Tyr 


405 






He 


Arg 


Ser 


420 






Leu 


His 


Lys 


435 






Cys 


Arg 


Trp 


450 






Gly 


Val 


Ser 


465 






Leu 


Asp 


Leu 


480 






Pro 


His 


Ser 


495 






Gin 


Pro 


Ser 


510 






Gin 


Gly 


Ala 


525 






His 


Ser 


Leu 


540 






Leu 


Ala 


Cys 


555 






Pro 


Ala 


Leu 


570 






Pro 


Ser 


Arg 


585 






Pro 


+ * * 


Arg 


600 






Leu 


Leu 


Gin 


bio 






Arg 


Gly 


Leu 


630 






Ala 


Ser 


Pro 


645 






* + * 


Ser 


Ala 


660 






Ser 


Pro 


Thr 


67 5 






Arg 


Leu 


Leu 







225 


Gly 


Lys 


Lys 






240 


Tyr 


Phe 


Phe 






255 


Gin 


Tyr 


Gin 






270 


Asp 


Leu 


Ala 






285 


Tyr 


He 


Pro 






300 


Phe 


He 


Arg 






315 


Met 


Asn 


His 






330 


Trp 


Phe 


Ser 






345 


Phe 


Phe 


Asn 






360 


His 


His 


Leu 






375 


Ala 


Pro 


Leu 






390 


Gin 


Glu 


Lys 






410 


Leu 


Lys 


Lys 






425 


* *■ + 


Ser 


His 






440 


Gly 


Asp 


Gly 






455 


Glu 


Arg 


Leu 






470 


Ser 


Pro 


Phe 






485 


Thr 


Leu 


Pro 






500 


Ala 


Met 


Ala 






515 


Glu 


Arg 


Trp 






530 


Pro 


Leu 


Lys 






545 


Glu 


Ser 


Pro 






560 


Val 


Leu 


Gin 






575 


Ala 


Gly 


Pro 






590 


Leu 


Pro 


Leu 






605 


Ser 


Ser 


Gly 






620 


Ser 


* + * 


Asp 






635 


Asn 


Leu 


Gly 






650 


Leu 


Thr 


Leu 






665 


* * * 


Ala 


Cys 






680 


* * + 


Leu 


Ser 



-170- 



WO 98/46764 



PCT/US98/07421 



10 



20 



50 



55 



60 



65 











685 










690 










695 


Arg 


Gly 


Ser 


Gly 


His 
700 


Val 


Gin 


Gly 


Gly 


Ala 
705 


Gly 


Trp 


Pro 


Gly 


Gly 
710 


Ser 


Ala 


His 


Pro 


Pro 
715 


Ala 


Phe 


Pro 


Gin 


Gly 
720 


Val 


Leu 


Arg 


Ser 


Lys 
725 


lie 


Leu 


Glu 


Gin 


Ser 
730 


Asp 


Pro 


Ser 


Pro 


Lys 
735 


Ala 


Leu 


Leu 


Ser 


Ala 
740 


Gly 


Gin 


Cys 


Gin 


Pro 


He 


Pro 


Gly 


His 


Leu 


Ala 


Pro 


Gly 


Asp 


Val 










745 










750 






755 


Gly 


Pro 


Xxx 



























(2) INFORMATION FOR SEQ ID NO: 45: 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



25 CGTATGTCAC TCCATTCCAA ACTCGTTCAT GGTATCATAA ATATCAACAC ATTTACGCTC 60 

CACTCCTCTA TGGTATTTAC ACACTCAAAT ATCGTACTCA AGATTGGGAA GCTTTTGTAA 120 

AGGATGGTAA AAATGGTGCA ATTCGTGTTA GTGTCGCCAC AAATTTCGAT AAGGCCGCTT 180 

ACGTCATTGG TAAATTGTCT TTTGTTTTCT TCCGTTTCAT CCTTCCACTC CGTTATCATA 240 

GCTTTACAGA TTTAATTTGT TATTTCCTCA TTGCTGAATT CGTCTTTGGT TGGTATCTCA 300 

30 CAATTAATTT CCAAGTTAGT CATGTCGCTG AAGATCTCAA ATTCTTTGCT ACCCCTGAAA 360 

GACCAGATGA ACCATCT CAA AT CAATGAAG ATTGGGCAAT CCTTCAACTT AAAACT ACT C 420 

AAGATTATGG TCATGGTTCA CTCCTTTGTA CCTTTTTTAG TGGTTCTTTA AATCATCAAG 4 80 

TTGTTCATCA TTTATTCCCA TCAATTGCTC AAGATTTCTA CCCACAACTT GTACCAATTG 540 

TAAAAGAAGT TTGTAAAGAA CATAACATTA CTTACCACAT TAAACCAAAC TTCACTGAAG 600 

35 CTATTATGTC ACACATTAAT TACCTTTACA AAATGGGTAA TGATCCAGAT TATGTTAAAA 660 

AACCATTAGC CTCAAAAGAT GATTAAATGA AATAACTTAA AAACCAATTA TTTACTTTTG 720 

ACAAACAGTA ATATTAATAA ATACAA 74 6 

40 (2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Tyr 


Val 


Thr 


Pro 


Phe 


Gin 


Thr 


Arg 


Ser 


Trp 


Tyr 


His 


Lys 


Tyr 


Gin 


1 








5 










10 






15 


His 


He 


Tyr 


Ala 


Pro 


Leu 


Leu 


Tyr 


Gly 


He 


Tyr 


Thr 


Leu 


Lys 


Tyr 










20 










25 








30 


Arg 


Thr 


Gin 


Asp 


Trp 


Glu 


Ala 


Phe 


Val 


Lys 


Asp 


Gly 


Lys 


Asn Gly 


Ala 








35 










40 










45 


He 


Arg 


Val 


Ser 


Val 


Ala 


Thr 


Asn 


Phe 


Asp 


Lys 


Ala 


Ala 


Tyr 


Val 








50 










55 










60 


He 


Gly 


Lys 


Leu 


Ser 


Phe 


Val 


Phe 


Phe 


Arg 


Phe 


He 


Leu 


Pro 










65 










70 








75 


Leu 


Arg 


Tyr 


His 


Ser 


Phe 


Thr 


Asp 


Leu 


He 


Cys 


Tyr 


Phe 


Leu 


He 










80 










85 






90 


Ala 


Glu 


Ph 


Val 


Phe 
95 


Gly 


Trp 


Tyr 


Leu 


Thr 
100 


He 


Asn 


Phe 


Gin 


Val 
105 
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10 



15 



50 



60 



65 



Ser 


His 


Val 


Ala 


Glu 


Asp 


Leu 


Lys 


Phe 


Phe 


Ala 


Thr 


Pro 


Glu 


Arg 










110 










115 










120 


Pro 


Asp 


Glu 


Pro 


Ser 


Gin 


lie 


Asn 


Glu 


Asp 


Trp 


Ala 


He 


Leu 


Gin 










125 










130 










135 


Leu 


Lys 


Thr 


Thr 


Gin 


Asp 


Tyr 


Gly 


His 


Gly 


Ser 


Leu 


Leu 


Cys 


Thr 










140 










145 










150 


Phe 


Phe 


Ser 


Gly 


Ser 


Leu 


Asn 


His 


Gin 


Val 


Val 


His 


His 


Leu 


Phe 










155 










160 










165 


Pro 


Ser 


lie 


Ala 


Gin 


Asp 


Phe 


Tyr 


Pro 


Gin 


Leu 


Val 


Pro 


He 


Val 










170 










175 










180 


Lys 


Glu 


Val 


Cys 


Lys 


Glu 


His 


Asn 


He 


Thr 


Tyr 


His 


He 


Lys 


Pro 










185 










190 






195 


Asn 


Phe 


Thr 


Glu 


Ala 


lie 


Met 


Ser 


His 


He 


Asn 


Tyr 


Leu 


Tyr 


Lys 










200 










205 








210 


Met 


Gly 


Asn 


Asp 


Pro 


Asp 


Tyr 


Val 


Lys 


Lys 


Pro 


Leu 


Ala 


Ser 


Lys 










215 










220 










225 


Asp 


Asp 


* * * 



























20 (2) INFORMATION FOR SEQ ID NO 47: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 nucleic acids 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

30 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 47: 

TTTTGGAAGG NTCCAAGTTN ACCACGGANT NGGCAAGTTN ACGGGGCGGA AANCGGTTTT 60 

CCCCCCAAGC CTTTTGTCGA CTGGTTCTGT GGTGGCTTCC AGTACCAAGT CGACCACCAC 120 

35 TTATTCCCCA GCCTGCCCCG ACACAATCTG GCCAAGACAC ACGCACTGGT CGAATCGTTC 180 

TGCAAGGAGT GGGGTGTCCA GTACCACGAA GCCGACCTCG TGGACGGGAC CATGGAAGTC 24 0 

TTGCACCATT TGGGCAGCGT GGCCGGCGAA TTCGTCGTGG ATTTTGTACG CGACGGACCC 300 

GCCATGTAAT CGTCGTTCGT GACGATGCAA GGGTTCACGC ACATCTACAC ACACTCACTC 360 

ACACAACTAG TGTAACTCGT ATAGAATTCG GTGTCGACCT GGACCTTGTT TGACTGGTTG 420 

40 GGGATAGGGT AGGTAGGCGG ACGCGTGGGT CGNCCCCGGG AATTCTGTGA CCGGTACCTG 480 

GCCCGCGTNA AAGT 494 



45 (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Phe Trp Lys Xxx Pro Ser Xxx Pro Arg Xxx Xxx Gin Val Xxx Gly 
1 5 10 15 

Ala Glu Xxx Gly Phe Pro Pro Lys Pro Phe Val Asp Trp Phe Cys 

20 25 30 

Gly Gly Phe Gin Tyr Gin Val Asp His His Leu Phe Pro Ser Leu 

35 40 45 

Pro Arg His Asn L u Ala Lys Thr His Ala Leu Val Glu Ser Phe 

50 55 60 

Cys Lys Glu Trp Gly Val Gin Tyr His Glu Ala Asp Leu Val Asp 

65 70 75 
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Gly Thr Met Glu Val Leu His His Leu Gly Ser Val Ala Gly Glu 
65 70 75 

Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
80 85 



10 



20 



55 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 520 nucleic acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



GGATGGAGTT CGTCTGGATC GCTGTGCGCT ACGCGACGTG GTTTAAGCGT CATGGGTGCG 60 

25 CTTGGGTACA CGCCGGGGCA GTCGTTGGGC ATGTACTTGT GCGCCTTTGG TCTCGGCTGC 120 

ATTTACATTT TTCTGCAGTT CGCCGTAAGT CACACCCATT TGCCCGTGAG CAACCCGGAG 180 

GATCAGCTGC ATTGGCTCGA GTACGCGCGG ACCACACTGT GAACATCAGC ACCAAGTCGT 240 

GGTTTGTCAC ATGGTGGATG TCGAACCTCA ACTTTCAGAT CGAGCACCAC CTTTTCCCCA -300 

CGGCGCCCCA GTTCCGTTTC AAGG AG AT C A GCCCGCGCGT CGAGGCCCTC TTCAAGCGCC 360 

30 ACGGTCTCCC TTACTACGAC ATGCCCTACA CGAGCGCCGT CTCCACCACC TTTGCCAACC 420 

TCTACTCCGT CGGCCATTCC GTCGGCGACG CCAAGCGCGA CTAGCCTCTT TTCCTAGACC 4 80 

TTAATTCCCC ACCCCACCCC ATGTTCTGTC TTCCTCCCGC 520 

35 (2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



50 



Met 


Glu 


Phe 


Val 


Trp 


He 


Ala 


Val 


Arg 


Tyr 


Ala 


Thr 


Trp Phe Lys 


1 








5 










10 






15 


Arg 


His 


Gly 


Cys 


Ala 


Trp 


Val 


His 


Ala 


Gly 


Ala 


Val 


Val Gly His 










20 










25 






30 


Val 


Leu 


Val 


Arg 


Leu 


Trp 


Ser 


Arg 


Leu 


His 


Leu 


His 


Phe Ser Ala 










35 










40 






45 


Val 


Arg 


Arg 


Lys 


Ser 


His 


Pro 


Phe 


Ala 


Arg 


Glu 


Gin 


Pro Gly Gly 










50 










55 






60 


Ser 


Ala 


Ala 


Leu 


Ala 


Arg 


Val 


Arg 


Ala 


Asp 


His 


Thr 


Val Asn He 










65 










70 






75 


Ser 


Thr 


Lys 


Ser 


Trp 


Phe 


Val 


Thr 


Trp 


Trp 


Met 


Ser 


Asn Leu Asn 










80 










85 






90 


Phe 


Gin 


He 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 


Ala 


Pro 


Gin Phe Arg 


Ph 








95 










100 






105 


Lys 


Glu 


He 


S r 


Pro 


Arg 


Val 


Glu 


Ala 


Leu 


Phe 


Lys Arg His 


Gly 








110 










115 






120 


Leu 


Pro 


Tyr 


Tyr 


Asp 


Met 


Pro 


Tyr 


Thr 


Ser 


Ala 


Val Ser Thr 










125 










130 






135 


Thr 


Phe 


Ala 


Asn 


Leu 


Tyr 


Ser 


Val 


Gly 


His 


Ser 


Val 


Gly Asp Ala 
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140 145 150 

Lys Arg Asp 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 nucleic acids 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



15 



30 



35 



40 



45 



50 



55 



60 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



ACGCGTCCGC CCACGCGTCC GCCGCGAGCA ACTCATCAAG GAAGGCTACT TTGACCCCTC 60 

20 GCTCCCGCAC ATGACGTACC GCGTGGTCGA GATTGTTGTT CTCTTCGTGC TTTCCTTTTG 120 

GCTGATGGGT CAGTCTTCAC CCCTCGCGCT CGCTCTCGGC ATTGTCGTCA GCGGCATCTC 180 

TCAGGGTCGC TGCGGCTGGG TAATGCATGA GATGGGCCAT GGGTCGTTCA CTGGTGTCAT 240 

TTGGCTTGAC GACCGGTTGT GCGAGTTCTT TTACGGCGTT GGTTGTGGCA TGAGCGGTCA 300 

TTACTGGAAA AACCAGCACA GCAAACACCA CGCAGCGCCA AACCGGCTCG AGCACGATGT 360 

25 AGATCTCAAC ACCTTGCCAT TGGTGGCCTT CAACGAGCGC GTCGTGCGCA AGGTCCGACC 420 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



Arg 


Val 
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Gly 


He 


Val 


Val 


Ser 


Gly 


He 


Ser 
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Gly 


Arg 
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Gly 
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Val 


Met 


His 


Glu 


Met 


Gly 


His 


Gly 


Ser 










65 










70 








75 


Phe 
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Gly 


Val 
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Trp 
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Asp 


Asp 


Arg 
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Phe 


Phe 
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Gly 


Val 


Gly 
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Met 
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Gly 
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His 








80 










85 








90 


Ser 


Lys 


His 


His 


Ala 
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Val 
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Val 
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What is claimed is : 

1 . A nucleic acid construct comprising: 

One or more nucleotide sequences depicted in a SEQ ID NO: selected from 
the group consisting of SEQ ID NO:l, SEQ ID NO:3 and SEQ ID NO:5, 
5 wherein said one or more nucleotide sequences is linked to a heterologous 

nucleotide sequence. 

2. A nucleic acid construct comprising: 

One or more nucleotide sequences depicted in a SEQ ID NO: selected from 
10 the group consisting of SEQ ID NO:l, SEQ ID NO:3 and SEQ ID NO:5, 

wherein said one or more nucleotide sequences is operably associated with an 
expression control sequence functional in a plant cell. 

3. The nucleic acid construct according to claim 2, wherein said nucleotide 
1 5 sequence has an average A + T content of less than about 60%. 

4. The nucleic acid construct according to claim 2, wherein said nucleotide 
sequence is derived from a fungus. 



20 5. The nucleic acid construct according to claim 4, wherein said fungus is of 

the genus Mortierella. 



6. The nucleic acid construct according to claim 5, wherein said fungus is of 
the species alpina. 

25 

7. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an amino 
acid sequence depicted in SEQ ID NO:2, wherein said nucleotide sequence is 
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operably associated with a transcription or an expression control sequence 
function in a plant cell, wherein said nucleotide sequence encodes a functionally 
active polypeptide which desaturates a fatty acid molecule at carbon 6 from the 
carboxyl end of said fatty acid molecule. 

5 

8. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an 
amino acid sequence depicted in SEQ ID NO:4, wherein said nucleotide 
sequence is operably associated with a transcription or an expression control 
10 sequence functional in a plant cell, wherein said nucleotide sequence encodes a 

functionally active polypeptide which desaturates a fatty acid molecule at 
carbon 12 from the carboxyl end of said fatty acid molecule. 



9. A nucleic acid construct comprising: 

15 A nucleotide sequence which encodes a polypeptide comprising an 

amino acid sequence depicted in SEQ ID NO:6, wherein said nucleotide 
sequence is operably associated with a transcription or an expression control 
sequence function in a plant cell, wherein said nucleotide sequence encodes a 
functionally active polypeptide which desaturates a fatty acid molecule at 

20 carbon 5 from the carboxyl end of said fatty acid moleculle. 



10. A nucleic acid construct comprising: 

at least one nucleotide sequence which encodes a functionally active 
desaturase having an amino acid sequence depicted in a SEQ ID NO: selected 
25 from the group consisting of SEQ ID NO:2, SEQ ID NO:4 and SEQ ED NO:6, 

wherein said nucleotide sequence is operably associated with a promoter 
functional in a plant cell. 
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11. The nucleic acid construct according to claim 10, wherein said plant cell is a 
seed cell. 



12. The nucleic acid construct according to claim 11, wherein said seed cell is 
5 an embryo cell. 

13. A recombinant plant cell comprising: 

At least one copy of a DNA sequence which encodes at least one 
functionally active Mortierella alpina fatty acid desaturase which results in the 

10 production of a polyunsaturated fatty acid, wherein said fatty acid desaturase 

has an amino acid sequence as depicted in a SEQ ID NO: selected from the 
group consisting of SEQ ID NO:2, SEQ ID NO:4, and SEQ ID NO:6, wherein 
said cell was transformed with a vector comprising said DNA sequence, and 
wherein said DNA sequence is operably associated with an expression control 

15 sequence. 



20 



14. The recombinant plant cell of claim 13, wherein said polyunsaturated fatty 
acid is selected from the group consisting of LA, ARA, GLA, DGLA, SDA 
and EPA. 



15. The recombinant plant cell of claim 13, wherein said recombinant plant cell 
is enriched in a fatty acid selected from the group consisting of 18: 1, 18:2, 
18:3 and 18:4. 



25 16. The recombinant plant cell of claim 15, wherein said plant cell is selected 

from the group consisting of Brassica, soybean, safflower, corn, flax, and 
sunflower. 



-177- 



BNSDOCID: <WO 9846764A1> 



WO 98/46764 



PCT/US98/07421 



17. The recombinant plant cell according to claim 16, wherein said expression 
control sequence is endogenous to said plant cell. 



18. One or more plant oils expressed by said recombinant plant cell of claim 16. 

5 

19. A method for obtaining altered long chain polyunsaturated fatty acid 
biosynthesis comprising the steps of: 

growing a plant having cells which contain a transgene encoding a 
transgene expression product which desaturates a fatty acid molecule at carbon 
10 5 from the carboxyl end of said fatty acid molecule, wherein said transgene is 

operably associated with an expression control sequence, under conditions 
whereby said transgene is expressed, whereby long chain polyunsaturated fatty 
acid biosynthesis in said cells is altered. 



15 20. A method for obtaining altered long chain polyunsaturated fatty acid 

biosynthesis comprising the steps of: 

growing a plant having cells which contain one or more transgenes, 
derived from a fungus or algae, which encodes a transgene expression product 
which desaturates a fatty acid molecule at a carbon selected from the group 
20 consisting of carbon 5, carbon 6 and carbon 12 from the carboxyl end of said 

fatty acid molecule, wherein said one or more transgenes is operably associated 
with an expression control sequence, under conditions whereby said one or 
more transgenes is expressed, whereby long chain polyunsaturated fatty acid 
biosynthesis in said cells is altered. 

25 

21. The method according to claims 19 or 20, wherein said long chain 
polyunsaturated fatty acid is selected from the group consisting of LA, ARA, 
GLA, DGLA, SDA and EPA. 
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22. A plant oil or fraction thereof produced according to the method of claims 
19 or 20. 

23. A method of treating or preventing malnutrition comprising administering 

5 said plant oil of claim 22 to a patient in need of said treatment or prevention 

in an amount sufficient to effect said treatment or prevention. 

24. A pharmaceutical composition comprising said plant oil or fraction of claim 
22 and a pharmaceutically acceptable carrier. 

10 

25. The pharmaceutical composition of claim 24, wherein said pharmaceutical 
composition is in the form of a solid or a liquid. 

26. The pharmaceutical composition of claim 25, wherein said pharmaceutical 
15 composition is in a capsule or tablet form. 

27. The pharmaceutical composition of claim 24 further comprising at least one 
nutrient selected from the group consisting of a vitamin, a mineral, a 
carbohydrate, a sugar, an amino acid, a free fatty acid, a phospholipid, an 

20 antioxidant, and a phenolic compound. 

28. A nutritional formula comprising said plant oil or fraction thereof of claim 
22. 

25 29. The nutritional formula of claim 28, wherein said nutritional formula is 

selected from the group consisting of an infant formula, a dietary 
supplement, and a dietary substitute. 
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30. The nutritional formula of claim 29, wherein said infant formula, dietary 
supplement or dietary supplement is in the form of a liquid or a solid. 

3 1 . An infant formula comprising said plant oil or fraction thereof of claim 22. 

5 

32. The infant formula of claim 3 1 further comprising at least one macronutrient 
selected from the group consisting of coconut oil, soy oil, canola oil, mono- 
and diglycerides, glucose, edible lactose, electrodialysed whey, 
electrodialysed skim milk, milk whey, soy protein, and other protein 

10 hydrolysates. 

33. The infant formula of claim 32 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 

15 magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 

chloride, iodine, selenium, and iron. 

34. A dietary supplement comprising said plant oil or fraction thereof of claim 
22. 

20 

35. The dietary supplement of claim 34 further comprising at least one 
macronutrient selected from the group consisting of coconut oil, soy oil, 
canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 

25 hydrolysates. 

36. The dietary supplement of claim 35 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 
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magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

37. The dietary supplement of claim 34 or claim 36, wherein said dietary 
5 supplement is administered to a human or an animal. 

38. A dietary substitute comprising said plant oil or fraction thereof of claim 22. 

39. The dietary substitute of claim 38 further comprising at least one 

10 macronutrient selected from the group consisting of coconut oil, soy oil, 

canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

40. The dietary substitute of claim 39 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 
magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

41. The dietary substitute of claim 38 or claim 40, wherein said dietary 
substitute is administered to a human or animal. 

42. A method of treating a patient having a condition caused by insuffient 

25 intake or production of polyunsaturated fatty acids comprising administering 

to said patient said dietary substitute of claim 38 or said dietary supplement 
of claim 34 in an amount sufficient to effect said treatment. 
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43. The method of claim 42, wherein said dietary substitute or said dietary 
supplement is administered enterally or parenterally. 

44. A cosmetic comprising said plant oil or fraction thereof of claim 22. 

5 

45. The cosmetic of claim 44, wherein said cosmetic is applied topically. 

46. The pharmaceutical composition of claim 24, wherein said pharmaceutical 
composition is administered to a human or an animal. 

10 

47. An animal feed comprising said plant oil or fraction thereof of claim 22. 

48. An isolated nucleotide sequence comprising the nucleotide sequence 
selected from the group consisting of SEQ ID NO:38 - SEQ ID NO:44 

15 wherein said nucleotide sequence is expressed in a plant cell. 

49. The method of claim 20 wherein said fungus is Mortierella species. 

50. The method of claim 49 wherein said fungus is Mortierella alpina. 

20 

51. An isolated nucleotide sequence selected from the group consisting of SEQ 
ID NO:49 - SEQ ID NO:50 wherein said sequence is expressed in a plant 
cell. 
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Recombinant plant cells comprising said constructs. 

Methods for obtaining altered long chain polyunsaturated fatty acid biosyn- 
thesis using plants comprising delta-5, delta-6, or delta-12 desaturases, or 
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Plant oils derived from said plants and their use for therapeutical, 
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